yinsong1986
commited on
Commit
•
f4c8132
1
Parent(s):
4ececff
Update README.md
Browse files
README.md
CHANGED
@@ -13,6 +13,10 @@ MistralLite is similar to [Mistral-7B-Instruct-v0.1](https://huggingface.co/mist
|
|
13 |
| Mistral-7B-Instruct-v0.1 | up to 8K tokens | 32K | rope_theta = 10000 | 4096 |
|
14 |
| MistralLite | up to 16K tokens | 32K | **rope_theta = 1000000** | **16384** |
|
15 |
|
|
|
|
|
|
|
|
|
16 |
## Motivation of Developing MistralLite
|
17 |
|
18 |
Since the release of [Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1), the model became increasingly popular because its strong performance
|
|
|
13 |
| Mistral-7B-Instruct-v0.1 | up to 8K tokens | 32K | rope_theta = 10000 | 4096 |
|
14 |
| MistralLite | up to 16K tokens | 32K | **rope_theta = 1000000** | **16384** |
|
15 |
|
16 |
+
**Important - Use the prompt template below for MistralLite:**
|
17 |
+
|
18 |
+
```<|prompter|>What are the main challenges to support a long context for LLM?</s><|assistant|>```
|
19 |
+
|
20 |
## Motivation of Developing MistralLite
|
21 |
|
22 |
Since the release of [Mistral-7B-Instruct-v0.1](https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1), the model became increasingly popular because its strong performance
|