llm-jp
/

llm-jp-13b-instruct-lora-jaster-dolly-oasst-v1.0

Text Generation

Model card Files Files and versions Community

losyer8 commited on Oct 20, 2023

Commit

8143ca6

•

1 Parent(s): 5c419f1

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -124,7 +124,7 @@ The models have been pre-trained using a blend of the following datasets.
 |Codes|[The Stack](https://huggingface.co/datasets/bigcode/the-stack)|10B
 The pre-training was continuously conducted using a total of 10 folds of non-overlapping data, each consisting of approximately 27-28B tokens.
-We finalized the pre-training with additional (potentially) high-quality 27B tokens data obtained from the identical source data sets listed above used for the 10-fold data.
 ### Instruction tuning

 |Codes|[The Stack](https://huggingface.co/datasets/bigcode/the-stack)|10B
 The pre-training was continuously conducted using a total of 10 folds of non-overlapping data, each consisting of approximately 27-28B tokens.
+We finalized the pre-training with additional (potentially) high-quality 27B tokens data obtained from the identical source datasets listed above used for the 10-fold data.
 ### Instruction tuning