Update README.md
Browse files
README.md
CHANGED
@@ -10,4 +10,10 @@ tags:
|
|
10 |
- text-generation-inference
|
11 |
- CultriX/NeuralTrix-7B-dpo
|
12 |
- dpo
|
13 |
-
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
10 |
- text-generation-inference
|
11 |
- CultriX/NeuralTrix-7B-dpo
|
12 |
- dpo
|
13 |
+
---
|
14 |
+
|
15 |
+
DPO Finetuned CultriX/NeuralTrix-7B-dpo using argilla/OpenHermes2.5-dpo-binarized-alpha
|
16 |
+
|
17 |
+
argilla dpo binarized pairs is a dataset built on top of: https://huggingface.co/datasets/teknium/OpenHermes-2.5 using https://github.com/argilla-io/distilabel if interested.
|
18 |
+
|
19 |
+
Thx for the great data sources.
|