Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
NicholasCorrado
/
zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
like
0
Text Generation
Transformers
Safetensors
data/zephyr_uf_rlced_conifer_ref
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
5c972e6
zephyr-7b-uf-rlced-conifer-group-dpo-2e-alr-0.01-1e
/
model-00003-of-00003.safetensors
Commit History
Training in progress, step 720
5c972e6
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 540
ffbcdf1
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 360
6fa27cd
verified
NicholasCorrado
commited on
Sep 11
Training in progress, step 180
0b86a66
verified
NicholasCorrado
commited on
Sep 11