Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ base_model: meta-llama/Llama-2-70b-hf
|
|
17 |
# Model Card for Tulu V2 DPO 70B
|
18 |
|
19 |
Tulu is a series of language models that are trained to act as helpful assistants.
|
20 |
-
Tulu V2 DPO 70B
|
21 |
This model is a strong alternative to Llama 2 70b Chat.
|
22 |
|
23 |
|
|
|
17 |
# Model Card for Tulu V2 DPO 70B
|
18 |
|
19 |
Tulu is a series of language models that are trained to act as helpful assistants.
|
20 |
+
Tulu V2 DPO 70B is a fine-tuned version of Llama 2 that was trained on on a mix of publicly available, synthetic and human datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
|
21 |
This model is a strong alternative to Llama 2 70b Chat.
|
22 |
|
23 |
|