Update README.md
Browse files
README.md
CHANGED
@@ -3,11 +3,11 @@ model-index:
|
|
3 |
- name: tulu-v2.5-dpo-13b-uf-mean
|
4 |
results: []
|
5 |
datasets:
|
6 |
-
-
|
7 |
- allenai/tulu-v2-sft-mixture
|
8 |
language:
|
9 |
- en
|
10 |
-
base_model:
|
11 |
license: apache-2.0
|
12 |
---
|
13 |
|
@@ -34,20 +34,9 @@ For more details, read the paper:
|
|
34 |
### Model Sources
|
35 |
|
36 |
- **Repository:** https://github.com/allenai/open-instruct
|
37 |
-
- **Dataset:** Data used to train this model can be found
|
38 |
- **Model Family:** The collection of related models can be found [here](https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
|
39 |
|
40 |
-
## Performance
|
41 |
-
|
42 |
-
| Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
|
43 |
-
|-------------|-----|----|---------------|--------------|
|
44 |
-
| **Tulu-v2-7b** πͺ | **7B** | **SFT** | **6.30** | **73.9** |
|
45 |
-
| **Tulu-v2-dpo-7b** πͺ | **7B** | **DPO** | **6.29** | **85.1** |
|
46 |
-
| **Tulu-v2-13b** πͺ | **13B** | **SFT** | **6.70** | **78.9** |
|
47 |
-
| **Tulu-v2-dpo-13b** πͺ | **13B** | **DPO** | **7.00** | **89.5** |
|
48 |
-
| **Tulu-v2-70b** πͺ | **70B** | **SFT** | **7.49** | **86.6** |
|
49 |
-
| **Tulu-v2-dpo-70b** πͺ | **70B** | **DPO** | **7.89** | **95.1** |
|
50 |
-
|
51 |
## Input Format
|
52 |
|
53 |
The model is trained to use the following format (note the newlines):
|
|
|
3 |
- name: tulu-v2.5-dpo-13b-uf-mean
|
4 |
results: []
|
5 |
datasets:
|
6 |
+
- allenai/tulu-2.5-preference-data
|
7 |
- allenai/tulu-v2-sft-mixture
|
8 |
language:
|
9 |
- en
|
10 |
+
base_model: allenai/tulu-2-dpo-13b
|
11 |
license: apache-2.0
|
12 |
---
|
13 |
|
|
|
34 |
### Model Sources
|
35 |
|
36 |
- **Repository:** https://github.com/allenai/open-instruct
|
37 |
+
- **Dataset:** Data used to train this model can be found [here](https://huggingface.co/datasets/allenai/tulu-2.5-preference-data).
|
38 |
- **Model Family:** The collection of related models can be found [here](https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
|
39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
## Input Format
|
41 |
|
42 |
The model is trained to use the following format (note the newlines):
|