allenai
/

tulu-2-dpo-70b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Nov 18, 2023

Commit

cbe7317

•

1 Parent(s): 8d97dab

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ base_model: meta-llama/Llama-2-70b-hf
 # Model Card for Tulu V2 DPO 70B
 Tulu is a series of language models that are trained to act as helpful assistants.
-Tulu V2 DPO 70B, and is a fine-tuned version of Llama 2 that was trained on on a mix of publicly available, synthetic and human datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
 This model is a strong alternative to Llama 2 70b Chat.

 # Model Card for Tulu V2 DPO 70B
 Tulu is a series of language models that are trained to act as helpful assistants.
+Tulu V2 DPO 70B is a fine-tuned version of Llama 2 that was trained on on a mix of publicly available, synthetic and human datasets using [Direct Preference Optimization (DPO)](https://arxiv.org/abs/2305.18290).
 This model is a strong alternative to Llama 2 70b Chat.