Update README.md
Browse files
README.md
CHANGED
@@ -54,6 +54,8 @@ Open-Llama-2-Ko is an auto-regressive language model that uses an optimized tran
|
|
54 |
Trained with selected corpus within AIHub/Modu Corpus. The detailed dataset list to train this model is available below:
|
55 |
|
56 |
- AI Hub: [corpus/AI_HUB](./corpus/AI_HUB)
|
|
|
|
|
57 |
- Modu Corpus: [corpus/MODU_CORPUS](./corpus/MODU_CORPUS)
|
58 |
|
59 |
Final JSONL dataset to trian this model is: 61GB.
|
|
|
54 |
Trained with selected corpus within AIHub/Modu Corpus. The detailed dataset list to train this model is available below:
|
55 |
|
56 |
- AI Hub: [corpus/AI_HUB](./corpus/AI_HUB)
|
57 |
+
- Used only `Training` part of the data.
|
58 |
+
- Explicitly dropped `Validation`/`Test` part of the data.
|
59 |
- Modu Corpus: [corpus/MODU_CORPUS](./corpus/MODU_CORPUS)
|
60 |
|
61 |
Final JSONL dataset to trian this model is: 61GB.
|