mattjordan
commited on
Commit
•
66afa12
1
Parent(s):
640eb46
Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ These models are trained on the Dolma dataset. We are releasing all code, checkp
|
|
22 |
| Size | Training Tokens | Layers | Hidden Size | Attention Heads | Context Length |
|
23 |
|------|--------|---------|-------------|-----------------|----------------|
|
24 |
| [OLMo 2-7B](https://huggingface.co/allenai/OLMo-2-1124-7B) | 4 Trillion | 32 | 4096 | 32 | 4096 |
|
25 |
-
| [OLMo 2-13B](https://huggingface.co/allenai/OLMo-2-1124-13B) | 5 Trillion | 40 | 5120 |
|
26 |
|
27 |
The core models released in this batch include the following:
|
28 |
|
|
|
22 |
| Size | Training Tokens | Layers | Hidden Size | Attention Heads | Context Length |
|
23 |
|------|--------|---------|-------------|-----------------|----------------|
|
24 |
| [OLMo 2-7B](https://huggingface.co/allenai/OLMo-2-1124-7B) | 4 Trillion | 32 | 4096 | 32 | 4096 |
|
25 |
+
| [OLMo 2-13B](https://huggingface.co/allenai/OLMo-2-1124-13B) | 5 Trillion | 40 | 5120 | 40 | 4096 |
|
26 |
|
27 |
The core models released in this batch include the following:
|
28 |
|