Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
nbeerbower
's Collections
abliteration loras
DPO
bruphin
flammen
llama 3 experiments
Nemo
DPO
updated
13 days ago
Various useful datasets with preference optimization
Upvote
3
jondurbin/gutenberg-dpo-v0.1
Viewer
•
Updated
Jan 12, 2024
•
918
•
1.24k
•
131
nbeerbower/gutenberg2-dpo
Viewer
•
Updated
Nov 16, 2024
•
293
•
106
•
19
jondurbin/truthy-dpo-v0.1
Viewer
•
Updated
Jan 11, 2024
•
1.02k
•
287
•
132
kyujinpy/orca_math_dpo
Viewer
•
Updated
Apr 12, 2024
•
15.3k
•
72
•
18
antiven0m/physical-reasoning-dpo
Viewer
•
Updated
Mar 23, 2024
•
899
•
100
•
10
flammenai/MahouMix-v1
Viewer
•
Updated
May 30, 2024
•
267
•
38
•
4
flammenai/Date-DPO-NoAsterisks
Viewer
•
Updated
Sep 18, 2024
•
330
•
52
•
4
nbeerbower/Arkhaios-DPO
Viewer
•
Updated
Nov 12, 2024
•
222
•
148
•
8
nbeerbower/Purpura-DPO
Viewer
•
Updated
Nov 12, 2024
•
230
•
109
•
7
nbeerbower/Schule-DPO
Viewer
•
Updated
Nov 16, 2024
•
34
•
97
•
1
HumanLLMs/Human-Like-DPO-Dataset
Viewer
•
Updated
23 days ago
•
10.9k
•
2.92k
•
188
nbeerbower/gutenberg-moderne-dpo
Viewer
•
Updated
Nov 17, 2024
•
346
•
122
•
2
nbeerbower/reddit-dpo
Viewer
•
Updated
4 days ago
•
76.9k
•
173
•
1
Atsunori/HelpSteer2-DPO
Viewer
•
Updated
Jul 11, 2024
•
7.59k
•
107
•
6
abacusai/MetaMath_DPO_FewShot
Viewer
•
Updated
Feb 26, 2024
•
395k
•
136
•
25
nbeerbower/GreatFirewall-DPO
Viewer
•
Updated
13 days ago
•
492
•
131
•
4
Upvote
3
Share collection
View history
Collection guide
Browse collections