Llama-3-Instruct-Referecnce-Free-Preference
Collection
3 items
•
Updated
•
3
This model is a fine-tuned version of meta-llama/Meta-Llama-3-8B-Instruct on the princeton-nlp/llama3-ultrafeedback dataset.
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Base model
meta-llama/Meta-Llama-3-8B-Instruct