allenai
/

tulu-v2.5-dpo-13b-uf-mean

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hamishivi commited on Jun 10

Commit

7f745ae

•

1 Parent(s): 75da40b

Update README.md

Files changed (1) hide show

README.md +3 -14

README.md CHANGED Viewed

@@ -3,11 +3,11 @@ model-index:
 - name: tulu-v2.5-dpo-13b-uf-mean
   results: []
 datasets:
-- HuggingFaceH4/ultrafeedback_binarized
 - allenai/tulu-v2-sft-mixture
 language:
 - en
-base_model: meta-llama/Llama-2-13b-hf
 license: apache-2.0
 ---
@@ -34,20 +34,9 @@ For more details, read the paper:
 ### Model Sources
 - **Repository:** https://github.com/allenai/open-instruct
-- **Dataset:** Data used to train this model can be found at **TODO UPLOAD DATA**
 - **Model Family:** The collection of related models can be found [here](https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
-## Performance
-| Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
-|-------------|-----|----|---------------|--------------|
-| **Tulu-v2-7b** 🐪 | **7B** | **SFT** | **6.30** | **73.9** |
-| **Tulu-v2-dpo-7b** 🐪 | **7B** | **DPO** | **6.29** | **85.1** |
-| **Tulu-v2-13b** 🐪 | **13B** | **SFT** | **6.70** | **78.9** |
-| **Tulu-v2-dpo-13b** 🐪 | **13B** | **DPO** | **7.00** | **89.5** |
-| **Tulu-v2-70b** 🐪 | **70B** | **SFT** | **7.49** | **86.6** |
-| **Tulu-v2-dpo-70b** 🐪 | **70B** | **DPO** | **7.89** | **95.1** |
 ## Input Format
 The model is trained to use the following format (note the newlines):

 - name: tulu-v2.5-dpo-13b-uf-mean
   results: []
 datasets:
+- allenai/tulu-2.5-preference-data
 - allenai/tulu-v2-sft-mixture
 language:
 - en
+base_model: allenai/tulu-2-dpo-13b
 license: apache-2.0
 ---
 ### Model Sources
 - **Repository:** https://github.com/allenai/open-instruct
+- **Dataset:** Data used to train this model can be found [here](https://huggingface.co/datasets/allenai/tulu-2.5-preference-data).
 - **Model Family:** The collection of related models can be found [here](https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
 ## Input Format
 The model is trained to use the following format (note the newlines):