Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
nlee282
/
moai-dpo-1.0
like
0
PEFT
TensorBoard
Safetensors
unalignment/toxic-dpo-v0.1
trl
dpo
Generated from Trainer
Model card
Files
Files and versions
Metrics
Training metrics
Community
Use this model
nlee282
commited on
Jan 5
Commit
0b46e68
•
1 Parent(s):
866158c
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+2
-0
README.md
CHANGED
Viewed
@@ -1,4 +1,6 @@
1
---
2
library_name: peft
3
tags:
4
- trl
1
---
2
+
datasets:
3
+
- unalignment/toxic-dpo-v0.1
4
library_name: peft
5
tags:
6
- trl