mosaicml
/

mpt-7b-8k-instruct

@@ -21,6 +21,7 @@ inference: false
 MPT-7B-Instruct-8k is a model for long-form instruction following, especially question-answering on and summarization of longer documents.
 It is built by finetuning [MPT-7B-8k](https://huggingface.co/mosaicml/mpt-7b-8k) on [Dolly HHRLHF](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) derived from the [Databricks Dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k) and the [Anthropic Helpful and Harmless (HH-RLHF)](https://huggingface.co/datasets/Anthropic/hh-rlhf) datasets. It is also trained on [Competition Math](https://huggingface.co/datasets/competition_math), [Duorc](https://huggingface.co/datasets/duorc), [CoT GSM8k](https://huggingface.co/datasets/conceptofmind/cot_submix_original), [Qasper](https://huggingface.co/datasets/allenai/qasper), [Quality](https://huggingface.co/datasets/emozilla/quality), [Summ Screen FD](https://huggingface.co/datasets/tau/scrolls) and [Spider](https://huggingface.co/datasets/spider).
   * License: _CC-By-SA-3.0_
   * [Demo on Hugging Face Spaces](https://huggingface.co/spaces/mosaicml/mpt-7b-instruct-8k)
@@ -143,18 +144,17 @@ The model has been modified from a standard transformer in the following ways:
 The model was trained on the following data mix:
-| Data Source | Number of Tokens in Source | Proportion |
 |-------------|----------------------------|------------|
-| Airoboros/GPT4-1.2 | 26.4M | 1.71% |
-| Baize | 55.0M | 3.57% |
-| Camel	| 301M | 19.54% |
-| GPTeacher	| 7.56M | 0.49% |
-| Guanaco | 15.6M | 1.02% |
-| LongCoversations | 18.4M | 1.19% |
-| ShareGPT | 821M | 53.24% |
-| WizardLM | 297M | 19.23% |
-"LongConversations" is a GPT3.5/4-generated dataset, details of which will be released at a later date.
 ### Training Configuration

 MPT-7B-Instruct-8k is a model for long-form instruction following, especially question-answering on and summarization of longer documents.
 It is built by finetuning [MPT-7B-8k](https://huggingface.co/mosaicml/mpt-7b-8k) on [Dolly HHRLHF](https://huggingface.co/datasets/mosaicml/dolly_hhrlhf) derived from the [Databricks Dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k) and the [Anthropic Helpful and Harmless (HH-RLHF)](https://huggingface.co/datasets/Anthropic/hh-rlhf) datasets. It is also trained on [Competition Math](https://huggingface.co/datasets/competition_math), [Duorc](https://huggingface.co/datasets/duorc), [CoT GSM8k](https://huggingface.co/datasets/conceptofmind/cot_submix_original), [Qasper](https://huggingface.co/datasets/allenai/qasper), [Quality](https://huggingface.co/datasets/emozilla/quality), [Summ Screen FD](https://huggingface.co/datasets/tau/scrolls) and [Spider](https://huggingface.co/datasets/spider).
+This is the same dataset that [MPT-30B-Instruct](https://huggingface.co/mosaicml/mpt-30b-instruct) was trained on.
   * License: _CC-By-SA-3.0_
   * [Demo on Hugging Face Spaces](https://huggingface.co/spaces/mosaicml/mpt-7b-instruct-8k)
 The model was trained on the following data mix:
+| Data Source | Number of Tokens in Source | Proportion |
 |-------------|----------------------------|------------|
+| competition_math | 1.6 M | 3.66% |
+| cot_gsm8k | 3.36 M | 7.67% |
+| dialogsum | 0.1 M | 0.23% |
+| dolly_hhrlhf | 5.89 M | 13.43% |
+| duorc | 7.8 M | 17.80% |
+| qasper | 8.72 M | 19.90% |
+| quality | 11.29 M | 25.78% |
+| scrolls/summ_screen_fd | 4.97 M | 11.33% |
+| spider | 0.089 M | 0.20% |
 ### Training Configuration