Update README.md
Browse files
README.md
CHANGED
@@ -6,4 +6,8 @@ language:
|
|
6 |
- en
|
7 |
---
|
8 |
|
9 |
-
This is a Llama 2 architecture model series trained on the TinyStories dataset, intended for use in the [llama2.c](https://github.com/karpathy/llama2.c) project by Andrej Karpathy
|
|
|
|
|
|
|
|
|
|
6 |
- en
|
7 |
---
|
8 |
|
9 |
+
This is a Llama 2 architecture model series trained on the TinyStories dataset, intended for use in the [llama2.c](https://github.com/karpathy/llama2.c) project by Andrej Karpathy.
|
10 |
+
|
11 |
+
trained on a single V100 32GB gpu for 3 epochs, we achieve an inference speed of ~72 tokens/sec.
|
12 |
+
|
13 |
+
learn more on how to run inference in pure C using [llama2.c](https://github.com/karpathy/llama2.c)
|