Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
qgallouedec
/
reward_modeling_anthropic_hh
like
0
Safetensors
opt
trl
reward-trainer
Generated from Trainer
License:
other
Model card
Files
Files and versions
Community
main
reward_modeling_anthropic_hh
Commit History
End of training
bff909d
verified
qgallouedec
HF staff
commited on
Aug 18
End of training
c027a09
verified
qgallouedec
HF staff
commited on
Aug 17
initial commit
cac74ad
verified
qgallouedec
HF staff
commited on
Aug 17