Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
paulo037
/
StableCode-DPO
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
4b584e2
StableCode-DPO
/
adapter_model.safetensors
Commit History
Training in progress, step 150
4b584e2
verified
paulo037
commited on
about 10 hours ago
Training in progress, step 125
c1b3713
verified
paulo037
commited on
about 11 hours ago
Training in progress, step 100
260b067
verified
paulo037
commited on
about 11 hours ago
Training in progress, step 75
bbb7f20
verified
paulo037
commited on
about 11 hours ago
Training in progress, step 50
694f28b
verified
paulo037
commited on
about 11 hours ago
Training in progress, step 25
962ca01
verified
paulo037
commited on
about 12 hours ago