kz919
/

QwQ-0.5B-Distilled

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kz919 commited on Dec 28, 2024

Commit

e4dbff9

·

verified ·

1 Parent(s): dc41ba3

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -27,7 +27,7 @@ base_model:
 ## Training:
 QwQ-0.5B-Distilled was trained using the **QwQ-LongCoT-130K dataset**, a carefully curated collection of long-context examples designed for reasoning and conversational AI tasks. The GKD framework ensures that the student model mimics the teacher model’s outputs, aligning its predictions with high-quality responses.
-### Training Progress
 [▓░░░░░░░░░░] 10%
 ### Training Script:

 ## Training:
 QwQ-0.5B-Distilled was trained using the **QwQ-LongCoT-130K dataset**, a carefully curated collection of long-context examples designed for reasoning and conversational AI tasks. The GKD framework ensures that the student model mimics the teacher model’s outputs, aligning its predictions with high-quality responses.
+### Training Progress:
 [▓░░░░░░░░░░] 10%
 ### Training Script: