Update README.md
Browse files
README.md
CHANGED
@@ -19,7 +19,8 @@ We utilize ONLY the prompts from [UltraFeedback](https://huggingface.co/datasets
|
|
19 |
This overview provides a high-level summary of our approach.
|
20 |
We plan to release more detailed results and findings in the coming weeks on the [Snorkel blog](https://snorkel.ai/blog/).
|
21 |
|
22 |
-
|
|
|
23 |
We executed the n_th DPO iteration using the "train/test_iteration_{n}".
|
24 |
|
25 |
### Key Premises:
|
|
|
19 |
This overview provides a high-level summary of our approach.
|
20 |
We plan to release more detailed results and findings in the coming weeks on the [Snorkel blog](https://snorkel.ai/blog/).
|
21 |
|
22 |
+
### Training recipe:
|
23 |
+
- This data is formatted to be compatible with the Hugging Face's [Zephyr recipe](https://github.com/huggingface/alignment-handbook/tree/main/recipes/zephyr-7b-beta).
|
24 |
We executed the n_th DPO iteration using the "train/test_iteration_{n}".
|
25 |
|
26 |
### Key Premises:
|