Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
hanyinwang
/
layer-project-reward-model
like
0
PEFT
Safetensors
hanyinwang/layer-project-reward-training
English
trl
reward-trainer
Generated from Trainer
License:
apache-2.0
Model card
Files
Files and versions
Community
Use this model
main
layer-project-reward-model
Commit History
Update README.md
a441430
verified
hanyinwang
commited on
May 3
Update README.md
d5a3dcf
verified
hanyinwang
commited on
May 3
Update README.md
c79a1aa
verified
hanyinwang
commited on
May 3
Upload data_reward_model_training.csv
7acfd95
verified
hanyinwang
commited on
May 3
End of training
1bce8c1
verified
hanyinwang
commited on
May 2
initial commit
06f8cba
verified
hanyinwang
commited on
May 2