Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lewtun
/
qwen2-0.5B-lr-3e-6
like
0
Safetensors
qwen2
trl
online-dpo
Generated from Trainer
Model card
Files
Files and versions
Community
main
qwen2-0.5B-lr-3e-6
/
model.safetensors
Commit History
Training in progress, step 977
bf35de4
verified
lewtun
HF staff
commited on
Aug 25
Training in progress, step 800
76db3e8
verified
lewtun
HF staff
commited on
Aug 24
Training in progress, step 600
72e44ba
verified
lewtun
HF staff
commited on
Aug 24
Training in progress, step 400
46351da
verified
lewtun
HF staff
commited on
Aug 24
Training in progress, step 200
145d72f
verified
lewtun
HF staff
commited on
Aug 24