Hugofernandez commited on
Commit
d747762
1 Parent(s): 12a9acf

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ pipeline_tag: text-generation
4
+ language:
5
+ - en
6
+ tags:
7
+ - pretrained
8
+ inference:
9
+ parameters:
10
+ temperature: 0.7
11
+ ---
12
+
13
+ # Model Card for Mistral-7B-Instruct-v0.1-colab-sharded
14
+
15
+ This model is a re-sharded version of Mistral-7B-Instruct-v0.1, with more shard (6 instead of 2), that enable an easier loading on machine that don't have a lot of RAM such a free version of Google Colab.
16
+
17
+ The original model can be found here : https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.1
18
+
19
+ ## Model Description
20
+
21
+ The Mistral-7B-v0.1 Large Language Model (LLM) is a pretrained generative text model with 7 billion parameters.
22
+ Mistral-7B-v0.1 outperforms Llama 2 13B on all benchmarks we tested.
23
+
24
+ For full details of this model please read the [paper](https://arxiv.org/abs/2310.06825) and [release blog post](https://mistral.ai/news/announcing-mistral-7b/).
25
+
26
+ ## Model Architecture
27
+
28
+ Mistral-7B-v0.1 is a transformer model, with the following architecture choices:
29
+ - Grouped-Query Attention
30
+ - Sliding-Window Attention
31
+ - Byte-fallback BPE tokenizer
32
+
33
+ ## Troubleshooting
34
+
35
+ - If you see the following error:
36
+ ```
37
+ KeyError: 'mistral'
38
+ ```
39
+ - Or:
40
+ ```
41
+ NotImplementedError: Cannot copy out of meta tensor; no data!
42
+ ```
43
+
44
+ Ensure you are utilizing a stable version of Transformers, 4.34.0 or newer.
45
+
46
+ ## Notice
47
+
48
+ Mistral 7B is a pretrained base model and therefore does not have any moderation mechanisms.
49
+
50
+ ## The Mistral AI Team
51
+
52
+ Albert Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lélio Renard Lavaud, Lucile Saulnier, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, William El Sayed.