Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
zkshan2002
/
DPO-uf-llama3-8B-OpenRLHF
like
0
Safetensors
HuggingFaceH4/ultrafeedback_binarized
llama
Model card
Files
Files and versions
Community
Train
Edit model card
README.md exists but content is empty. Use the
Edit model card
button to edit it.
Downloads last month
840
Safetensors
Model size
8.03B params
Tensor type
BF16
·
Inference API
Unable to determine this model's library. Check the
docs
.
Model tree for
zkshan2002/DPO-uf-llama3-8B-OpenRLHF
Base model
OpenRLHF/Llama-3-8b-sft-mixture
Finetuned
(
3
)
this model
Quantizations
1 model
Dataset used to train
zkshan2002/DPO-uf-llama3-8B-OpenRLHF
HuggingFaceH4/ultrafeedback_binarized
Viewer
•
Updated
Oct 16
•
187k
•
6.29k
•
244