Update README.md
Browse files
README.md
CHANGED
@@ -184,7 +184,7 @@ Please refer to [togethercomputer/RedPajama-Data-1T](https://huggingface.co/data
|
|
184 |
- **Optimizer:** Apex FusedAdam
|
185 |
- **Parallelism:** Pipeline parallel 12, tensor parallel 2
|
186 |
- **Gradient Accumulations**: 8 (global batch size 4M tokens)
|
187 |
-
- **Num of Tokens:**
|
188 |
- **Learning rate:** 0.00012
|
189 |
|
190 |
## Benchmark
|
|
|
184 |
- **Optimizer:** Apex FusedAdam
|
185 |
- **Parallelism:** Pipeline parallel 12, tensor parallel 2
|
186 |
- **Gradient Accumulations**: 8 (global batch size 4M tokens)
|
187 |
+
- **Num of Tokens:** 1.001T Tokens
|
188 |
- **Learning rate:** 0.00012
|
189 |
|
190 |
## Benchmark
|