Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
paulo037
/
StableCode-DPO
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
4b584e2
StableCode-DPO
Commit History
Training in progress, step 150
4b584e2
verified
paulo037
commited on
about 9 hours ago
Training in progress, step 125, checkpoint
d33638d
verified
paulo037
commited on
about 9 hours ago
Training in progress, step 125
c1b3713
verified
paulo037
commited on
about 9 hours ago
Training in progress, step 100, checkpoint
8f878e3
verified
paulo037
commited on
about 9 hours ago
Training in progress, step 100
260b067
verified
paulo037
commited on
about 9 hours ago
Training in progress, step 75, checkpoint
de4c2b5
verified
paulo037
commited on
about 10 hours ago
Training in progress, step 75
bbb7f20
verified
paulo037
commited on
about 10 hours ago
Training in progress, step 50, checkpoint
a9d283d
verified
paulo037
commited on
about 10 hours ago
Training in progress, step 50
694f28b
verified
paulo037
commited on
about 10 hours ago
Training in progress, step 25, checkpoint
8ba1b95
verified
paulo037
commited on
about 10 hours ago
Training in progress, step 25
962ca01
verified
paulo037
commited on
about 10 hours ago
initial commit
aa25788
verified
paulo037
commited on
about 10 hours ago