tiny-reasoning LLM trained via RL for reasoning tasks. CaptainHPY/Qwen2.5-7B-R1-Zero Text Generation • 5B • Updated Sep 16 • 11 CaptainHPY/Qwen2.5-7B-R1 Text Generation • 5B • Updated Sep 17 • 14 CaptainHPY/Qwen2.5-7B-R1-Zero-GGUF Text Generation • 8B • Updated Sep 17 • 51 CaptainHPY/Qwen2.5-7B-R1-GGUF Text Generation • 8B • Updated Sep 18 • 91
tiny-reasoning LLM trained via RL for reasoning tasks. CaptainHPY/Qwen2.5-7B-R1-Zero Text Generation • 5B • Updated Sep 16 • 11 CaptainHPY/Qwen2.5-7B-R1 Text Generation • 5B • Updated Sep 17 • 14 CaptainHPY/Qwen2.5-7B-R1-Zero-GGUF Text Generation • 8B • Updated Sep 17 • 51 CaptainHPY/Qwen2.5-7B-R1-GGUF Text Generation • 8B • Updated Sep 18 • 91