Update README.md
Browse files
README.md
CHANGED
@@ -9,4 +9,6 @@ pipeline_tag: text-generation
|
|
9 |
|
10 |
A language model trained on Russian. May be suitable for further tuning. The 100 gigabyte dataset consisted primarily of web pages, books, poems, and prose. The model was trained over 2 epochs.
|
11 |
|
12 |
-
Uses GPT-J architecture with a context window of 4k tokens.
|
|
|
|
|
|
9 |
|
10 |
A language model trained on Russian. May be suitable for further tuning. The 100 gigabyte dataset consisted primarily of web pages, books, poems, and prose. The model was trained over 2 epochs.
|
11 |
|
12 |
+
Uses GPT-J architecture with a context window of 4k tokens.
|
13 |
+
|
14 |
+
Trained thanks to a TRC grant on TPU-VM v3-8
|