Edit Models filters

Misc

Inference Endpoints

AutoTrain Compatible

text-generation-inference

4-bit precision

Carbon Emissions

Misc with no match

8-bit precision

text-embeddings-inference

Mixture of Experts

Models

115

Full-text search

Active filters: grpo

mgaimm/qwen-2.5-3b-r1-countdown

Text Generation • Updated 5 days ago • 10

tuyentx/qwen-2.5-3b-r1-countdown

Text Generation • Updated 5 days ago • 9

spinech/qwen2.5-7b-r1-rearc-stage2

Text Generation • Updated 5 days ago • 40

pablo-chocobar/qwen-2.5-3b-r1-countdown

Text Generation • Updated 3 days ago • 7

mradermacher/Qwen2.5-1.5B-Open-R1-GRPO-GGUF

Updated 4 days ago • 318

Julian-Sheeper/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated 4 days ago • 5

pullpull/qwen-2.5-3b-r1-countdown

Text Generation • Updated 4 days ago • 2

justinj92/Qwen2.5-1.5B-Thinking-Q8_0-GGUF

Updated 4 days ago • 22

justinj92/Qwen2.5-1.5B-Thinking-Q5_K_M-GGUF

Updated 4 days ago • 21

spinech/qwen2.5-3b-r1-arc-train

Text Generation • Updated 3 days ago • 112

howardzhou/Qwen2.5-3B-Open-R1-GRPO

Text Generation • Updated 1 day ago • 7

jainamit/qwen-2.5-3b-r1-countdown

Text Generation • Updated about 5 hours ago • 2

GitBag/Qwen2.5-1.5B-Open-R1-GRPO

Text Generation • Updated 3 days ago • 2

justinj92/Qwen2.5-1.5B-Thinking-v1.1-Q8_0-GGUF

Updated 3 days ago • 3

justinj92/Qwen2.5-1.5B-Thinking-v1.1-Q5_K_M-GGUF

Updated 3 days ago • 19

Dongwei/Qwen-2.5-7B

Text Generation • Updated 4 days ago • 6

emre/Qwen-0.5B-GRPO

Text Generation • Updated 3 days ago • 40

peulsilva/reasoning-qwen-epoch0

Text Generation • Updated 3 days ago • 4

peulsilva/reasoning-qwen-epoch1

Text Generation • Updated 3 days ago • 6

spinech/qwen2.5-3b-r1-arc-train-synthetic

Text Generation • Updated 2 days ago • 28

peulsilva/reasoning-qwen-epoch2

Text Generation • Updated 3 days ago • 4

laolaorkk/Qwen2.5-1.5B-R1-GRPO

Text Generation • Updated about 8 hours ago • 6

Dongwei/DeepSeek-R1-Distill-Qwen-7B-GRPO_Math

Text Generation • Updated 3 days ago • 26

Dongwei/Qwen-2.5-7B_Math

Text Generation • Updated 3 days ago • 20

Dongwei/Qwen2.5-1.5B-Open-R1-GRPO_Math

Text Generation • Updated 3 days ago • 14

Dongwei/DeepSeek-R1-Distill-Qwen-1.5B-GRPO_Math

Text Generation • Updated 3 days ago • 16

peulsilva/reasoning-qwen-epoch3

Text Generation • Updated 3 days ago • 4

mradermacher/DeepSeek-R1-Distill-Qwen-7B-GRPO-GGUF

Updated 3 days ago • 1.29k

AndreasX1206/Qwen2-0.5B-countdown

Text Generation • Updated 3 days ago • 6

mradermacher/Qwen-0.5B-GRPO-GGUF

Updated 3 days ago • 254