Update README.md
Browse files
README.md
CHANGED
@@ -45,8 +45,9 @@ language:
|
|
45 |
|
46 |
# Salamandra Model Card
|
47 |
|
48 |
-
Salamandra
|
49 |
-
|
|
|
50 |
|
51 |
To visit the model cards of other Salamandra versions, please refer to the [Model Index](#model-index).
|
52 |
|
@@ -65,7 +66,7 @@ Along with the open weights, all training scripts and configuration files are ma
|
|
65 |
|
66 |
### Description
|
67 |
|
68 |
-
Transformer-based decoder-only language model that has been pre-trained on 7.8 trillion tokens of highly curated data.
|
69 |
The pre-training corpus contains text in 35 European languages and code.
|
70 |
|
71 |
### Hyperparameters
|
|
|
45 |
|
46 |
# Salamandra Model Card
|
47 |
|
48 |
+
Salamandra is a highly multilingual model pre-trained from scratch that comes in three different
|
49 |
+
sizes — 2B, 7B and 40B parameters — with their respective base and instruction-tuned variants.
|
50 |
+
This model card corresponds to the 7B instructed version.
|
51 |
|
52 |
To visit the model cards of other Salamandra versions, please refer to the [Model Index](#model-index).
|
53 |
|
|
|
66 |
|
67 |
### Description
|
68 |
|
69 |
+
Transformer-based decoder-only language model that has been pre-trained from scratch on 7.8 trillion tokens of highly curated data.
|
70 |
The pre-training corpus contains text in 35 European languages and code.
|
71 |
|
72 |
### Hyperparameters
|