beomi
/

open-llama-2-ko-7b

Text Generation

text-generation-inference

Model card Files Files and versions Community

beomi commited on Dec 14, 2023

Commit

5a0fcef

•

1 Parent(s): a433f27

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -54,6 +54,8 @@ Open-Llama-2-Ko is an auto-regressive language model that uses an optimized tran
 Trained with selected corpus within AIHub/Modu Corpus. The detailed dataset list to train this model is available below:
 - AI Hub: [corpus/AI_HUB](./corpus/AI_HUB)
 - Modu Corpus: [corpus/MODU_CORPUS](./corpus/MODU_CORPUS)
 Final JSONL dataset to trian this model is: 61GB.

 Trained with selected corpus within AIHub/Modu Corpus. The detailed dataset list to train this model is available below:
 - AI Hub: [corpus/AI_HUB](./corpus/AI_HUB)
+  - Used only `Training` part of the data.
+  - Explicitly dropped `Validation`/`Test` part of the data.
 - Modu Corpus: [corpus/MODU_CORPUS](./corpus/MODU_CORPUS)
 Final JSONL dataset to trian this model is: 61GB.