Text Generation
Transformers
Safetensors
English
llama
conversational
text-generation-inference
Inference Endpoints
hamishivi commited on
Commit
7f745ae
β€’
1 Parent(s): 75da40b

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -14
README.md CHANGED
@@ -3,11 +3,11 @@ model-index:
3
  - name: tulu-v2.5-dpo-13b-uf-mean
4
  results: []
5
  datasets:
6
- - HuggingFaceH4/ultrafeedback_binarized
7
  - allenai/tulu-v2-sft-mixture
8
  language:
9
  - en
10
- base_model: meta-llama/Llama-2-13b-hf
11
  license: apache-2.0
12
  ---
13
 
@@ -34,20 +34,9 @@ For more details, read the paper:
34
  ### Model Sources
35
 
36
  - **Repository:** https://github.com/allenai/open-instruct
37
- - **Dataset:** Data used to train this model can be found at **TODO UPLOAD DATA**
38
  - **Model Family:** The collection of related models can be found [here](https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
39
 
40
- ## Performance
41
-
42
- | Model | Size | Alignment | MT-Bench (score) | AlpacaEval (win rate %) |
43
- |-------------|-----|----|---------------|--------------|
44
- | **Tulu-v2-7b** πŸͺ | **7B** | **SFT** | **6.30** | **73.9** |
45
- | **Tulu-v2-dpo-7b** πŸͺ | **7B** | **DPO** | **6.29** | **85.1** |
46
- | **Tulu-v2-13b** πŸͺ | **13B** | **SFT** | **6.70** | **78.9** |
47
- | **Tulu-v2-dpo-13b** πŸͺ | **13B** | **DPO** | **7.00** | **89.5** |
48
- | **Tulu-v2-70b** πŸͺ | **70B** | **SFT** | **7.49** | **86.6** |
49
- | **Tulu-v2-dpo-70b** πŸͺ | **70B** | **DPO** | **7.89** | **95.1** |
50
-
51
  ## Input Format
52
 
53
  The model is trained to use the following format (note the newlines):
 
3
  - name: tulu-v2.5-dpo-13b-uf-mean
4
  results: []
5
  datasets:
6
+ - allenai/tulu-2.5-preference-data
7
  - allenai/tulu-v2-sft-mixture
8
  language:
9
  - en
10
+ base_model: allenai/tulu-2-dpo-13b
11
  license: apache-2.0
12
  ---
13
 
 
34
  ### Model Sources
35
 
36
  - **Repository:** https://github.com/allenai/open-instruct
37
+ - **Dataset:** Data used to train this model can be found [here](https://huggingface.co/datasets/allenai/tulu-2.5-preference-data).
38
  - **Model Family:** The collection of related models can be found [here](https://huggingface.co/collections/allenai/tulu-v25-suite-66676520fd578080e126f618).
39
 
 
 
 
 
 
 
 
 
 
 
 
40
  ## Input Format
41
 
42
  The model is trained to use the following format (note the newlines):