license: other datasets: - mlabonne/orpo-dpo-mix-40k tags: - dpo
This is a DPO fine-tune of Daredevil-8-abliterated trained on one epoch of orpo-dpo-mix-40k.
TBD.