mattshumer
commited on
Commit
•
50089e2
1
Parent(s):
5a67bfe
Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,6 @@ datasets:
|
|
3 |
- Yukang/LongAlpaca-16k-length
|
4 |
---
|
5 |
|
6 |
-
This is an extended (16K) context version of LLaMA 3. Trained for five hours on 8x A6000 GPUs, using the `Yukang/LongAlpaca-16k-length` dataset.
|
7 |
|
8 |
`rope_theta` was set to `1000000.0`. Trained with Axolotl.
|
|
|
3 |
- Yukang/LongAlpaca-16k-length
|
4 |
---
|
5 |
|
6 |
+
This is an extended (16K) context version of LLaMA 3 8B (base, not instruct). Trained for five hours on 8x A6000 GPUs, using the `Yukang/LongAlpaca-16k-length` dataset.
|
7 |
|
8 |
`rope_theta` was set to `1000000.0`. Trained with Axolotl.
|