Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
spinech
/
qwen-2.5-3b-r1-countdown
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown
/
model-00002-of-00002.safetensors
Commit History
Training in progress, step 450
deb5732
verified
spinech
commited on
12 days ago
Training in progress, step 425
af5da4c
verified
spinech
commited on
12 days ago
Training in progress, step 400
2488a3c
verified
spinech
commited on
12 days ago
Training in progress, step 375
e3d6b43
verified
spinech
commited on
12 days ago
Training in progress, step 350
5442796
verified
spinech
commited on
12 days ago
Training in progress, step 325
eba3da1
verified
spinech
commited on
12 days ago
Training in progress, step 300
8fa386b
verified
spinech
commited on
12 days ago
Training in progress, step 275
01fa27e
verified
spinech
commited on
12 days ago
Training in progress, step 250
79df401
verified
spinech
commited on
12 days ago
Training in progress, step 225
4ae5596
verified
spinech
commited on
12 days ago
Training in progress, step 200
87f57a8
verified
spinech
commited on
12 days ago
Training in progress, step 175
79dca64
verified
spinech
commited on
12 days ago
Training in progress, step 150
16929c4
verified
spinech
commited on
12 days ago
Training in progress, step 125
de2d8d8
verified
spinech
commited on
12 days ago
Training in progress, step 100
46ed6c4
verified
spinech
commited on
12 days ago
Training in progress, step 75
cadda9a
verified
spinech
commited on
12 days ago
Training in progress, step 50
97b1c8a
verified
spinech
commited on
12 days ago
Training in progress, step 25
d468990
verified
spinech
commited on
12 days ago