Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
lewtun
/
qwen2-1.5B-lr-3e-6
like
0
Safetensors
qwen2
trl
online-dpo
Generated from Trainer
Model card
Files
Files and versions
Community
main
qwen2-1.5B-lr-3e-6
Commit History
Model save
2c6c9bc
verified
lewtun
HF staff
commited on
Aug 25
Training in progress, step 977
742696d
verified
lewtun
HF staff
commited on
Aug 25
Training in progress, step 800
3465061
verified
lewtun
HF staff
commited on
Aug 25
Training in progress, step 600
175f704
verified
lewtun
HF staff
commited on
Aug 24
Training in progress, step 400
eebcae2
verified
lewtun
HF staff
commited on
Aug 24
Training in progress, step 200
c4a64a5
verified
lewtun
HF staff
commited on
Aug 24
initial commit
32a1f33
verified
lewtun
HF staff
commited on
Aug 23