Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
robinsmits
/
Qwen1.5-7B-Dutch-Chat-Dpo
like
0
Text Generation
PEFT
TensorBoard
Safetensors
BramVanroy/ultra_feedback_dutch_cleaned
Dutch
trl
dpo
conversational
Generated from Trainer
qwen2
arxiv:
2309.16609
License:
cc-by-nc-4.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
1
Use this model
main
Qwen1.5-7B-Dutch-Chat-Dpo
/
adapter_model.safetensors
Commit History
End of training
b7e48de
verified
robinsmits
commited on
Mar 29
Training in progress, step 300
1857007
verified
robinsmits
commited on
Mar 29
Training in progress, step 270
165f205
verified
robinsmits
commited on
Mar 29
Training in progress, step 240
a20a562
verified
robinsmits
commited on
Mar 29
Training in progress, step 210
01c8180
verified
robinsmits
commited on
Mar 29
Training in progress, step 180
cff81df
verified
robinsmits
commited on
Mar 29
Training in progress, step 150
b2451e8
verified
robinsmits
commited on
Mar 29
Training in progress, step 120
01abb93
verified
robinsmits
commited on
Mar 29
Training in progress, step 90
77c2265
verified
robinsmits
commited on
Mar 29
Training in progress, step 60
d2297e8
verified
robinsmits
commited on
Mar 29
Training in progress, step 30
50a6639
verified
robinsmits
commited on
Mar 29