micaebe
/

Qwen2.5-1.5B-Instruct-QwQ

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

micaebe commited on 18 days ago

Commit

f6325e0

•

1 Parent(s): 7892031

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ model-index:
 ## Introduction
-Qwen2.5-QwQ is a fine-tuned model based on Qwen2.5-1.5B-Instruct. It was fine-tuned on roughly 20k samples from QwQ-32B-Preview. Compared to Qwen2.5-1.5B-Instruct, this fine-tuned model seems more performant in mathematics contexts and general reasoning. Also it shows some capabilities of self-correction, altough it seems a bit limited because of the size (bigger models seem to learn self-correction more easily, e.g. the 3B & 7B version show much better self-correction abilities).
 **This repo contains the instruction-tuned 1.5B Qwen2.5 model fine-tuned on QwQ reasoning chains**, which has the following features:
 - Type: Causal Language Models

 ## Introduction
+Qwen2.5-1.5B-Instruct-QwQ is a fine-tuned model based on Qwen2.5-1.5B-Instruct. It was fine-tuned on roughly 20k samples from QwQ-32B-Preview. Compared to Qwen2.5-1.5B-Instruct, this fine-tuned model seems more performant in mathematics contexts and general reasoning. Also it shows some capabilities of self-correction, altough it seems a bit limited because of the size (bigger models seem to learn self-correction more easily, e.g. the 3B & 7B version show much better self-correction abilities).
 **This repo contains the instruction-tuned 1.5B Qwen2.5 model fine-tuned on QwQ reasoning chains**, which has the following features:
 - Type: Causal Language Models