Update README.md
Browse files
README.md
CHANGED
@@ -27,7 +27,7 @@ base_model:
|
|
27 |
## Training:
|
28 |
|
29 |
QwQ-0.5B-Distilled was trained using the **QwQ-LongCoT-130K dataset**, a carefully curated collection of long-context examples designed for reasoning and conversational AI tasks. The GKD framework ensures that the student model mimics the teacher modelβs outputs, aligning its predictions with high-quality responses.
|
30 |
-
### Training Progress
|
31 |
[βββββββββββ] 10%
|
32 |
|
33 |
### Training Script:
|
|
|
27 |
## Training:
|
28 |
|
29 |
QwQ-0.5B-Distilled was trained using the **QwQ-LongCoT-130K dataset**, a carefully curated collection of long-context examples designed for reasoning and conversational AI tasks. The GKD framework ensures that the student model mimics the teacher modelβs outputs, aligning its predictions with high-quality responses.
|
30 |
+
### Training Progress:
|
31 |
[βββββββββββ] 10%
|
32 |
|
33 |
### Training Script:
|