Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
terry69
/
preference_p0.2_seed42_level2_raremixbatch16
like
0
Text Generation
Transformers
TensorBoard
Safetensors
preference-data
mistral
alignment-handbook
trl
sft
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
edfd039
preference_p0.2_seed42_level2_raremixbatch16
/
eval_results.json
Commit History
End of training
949c984
verified
terry69
commited on
Sep 15
End of training
8227740
verified
terry69
commited on
Sep 15
End of training
80a14c7
verified
terry69
commited on
Sep 12