Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ pipeline_tag: text-generation
|
|
16 |
|
17 |
# Llama 3.2 180M Amharic
|
18 |
|
19 |
-
This is a smaller version of the Meta's [Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) decoder transformer model pretrained from scratch for **
|
20 |
|
21 |
- It has **180 Million parameters**
|
22 |
- The **context size** of this model is **1024** tokens.
|
|
|
16 |
|
17 |
# Llama 3.2 180M Amharic
|
18 |
|
19 |
+
This is a smaller version of the Meta's [Llama-3.2-1B](https://huggingface.co/meta-llama/Llama-3.2-1B) decoder transformer model pretrained from scratch for **26 hours** using a single **A100 40GB** GPU **274 million tokens** of **Amharic** text.
|
20 |
|
21 |
- It has **180 Million parameters**
|
22 |
- The **context size** of this model is **1024** tokens.
|