Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
gpandrad
/
qwen-2.5-3b-r1-countdown
like
0
Text Generation
Transformers
TensorBoard
Safetensors
qwen2
Generated from Trainer
trl
grpo
conversational
text-generation-inference
Inference Endpoints
arxiv:
2402.03300
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen-2.5-3b-r1-countdown
Commit History
Model save
65dca1c
verified
gpandrad
commited on
5 days ago
Training in progress, step 451
021e030
verified
gpandrad
commited on
5 days ago
Model save
42558de
verified
gpandrad
commited on
6 days ago
Training in progress, step 450
b350eda
verified
gpandrad
commited on
6 days ago
Training in progress, step 425
709ae38
verified
gpandrad
commited on
6 days ago
Training in progress, step 400
6e2a776
verified
gpandrad
commited on
6 days ago
Training in progress, step 375
29f8858
verified
gpandrad
commited on
6 days ago
Training in progress, step 350
6b8a159
verified
gpandrad
commited on
6 days ago
Training in progress, step 325
a09df17
verified
gpandrad
commited on
6 days ago
Training in progress, step 300
fabe68d
verified
gpandrad
commited on
6 days ago
Training in progress, step 275
459d82d
verified
gpandrad
commited on
6 days ago
Training in progress, step 250
487c029
verified
gpandrad
commited on
6 days ago
Training in progress, step 225
8ff4e40
verified
gpandrad
commited on
6 days ago
Training in progress, step 200
3caafa0
verified
gpandrad
commited on
6 days ago
Training in progress, step 175
973895b
verified
gpandrad
commited on
6 days ago
Training in progress, step 150
bbed140
verified
gpandrad
commited on
6 days ago
Training in progress, step 125
ec31394
verified
gpandrad
commited on
6 days ago
Training in progress, step 100
f47f8da
verified
gpandrad
commited on
6 days ago
Training in progress, step 75
14c2349
verified
gpandrad
commited on
6 days ago
Training in progress, step 50
8cb9a1f
verified
gpandrad
commited on
6 days ago
Training in progress, step 25
ce53d48
verified
gpandrad
commited on
6 days ago
initial commit
2d2cee0
verified
gpandrad
commited on
6 days ago