EleutherAI
/

polyglot-ko-3.8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hyunwoongko commited on Oct 4, 2022

Commit

94bc87d

•

1 Parent(s): 451a670

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -31,7 +31,7 @@ dimensions of each head. The model is trained with a tokenization vocabulary of
 ## Training data
-Polyglot-Ko was trained on 863 GB of Korean language data (1.2TB before processing), a large-scale dataset curated by [TUNiB](https://tunib.ai/). The data collection process has abided by South Korean laws. This dataset was collected for the purpose of training Polyglot-Ko models, so it will not be released for public use.
 | Source                              |Size (GB) | Link                                  |
 |-------------------------------------|---------|------------------------------------------|

 ## Training data
+Polyglot-Ko-3.8B was trained on 863 GB of Korean language data (1.2TB before processing), a large-scale dataset curated by [TUNiB](https://tunib.ai/). The data collection process has abided by South Korean laws. This dataset was collected for the purpose of training Polyglot-Ko models, so it will not be released for public use.
 | Source                              |Size (GB) | Link                                  |
 |-------------------------------------|---------|------------------------------------------|