Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
NicholasCorrado
/
zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
like
0
Text Generation
Transformers
Safetensors
data/zephyr_uf_rlced_conifer_ref
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
Commit History
End of training
30a9f05
verified
NicholasCorrado
commited on
Sep 11
Model save
8b5f5d2
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 720
5c972e6
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 540
ffbcdf1
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 360
6fa27cd
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 180
0b86a66
verified
NicholasCorrado
commited on
Sep 11
initial commit
4abddbb
verified
NicholasCorrado
commited on
Sep 11