alexwb
/

reward_modeling_anthropic_hh_rm1e-6

Generated from Trainer

Model card Files Files and versions Community

reward_modeling_anthropic_hh_rm1e-6

Commit History

End of training

6b77313
verified

alexwb commited on Aug 3

initial commit

d2034e2
verified

alexwb commited on Aug 3