Voicelab
/

trurl-2-13b-academic

Text Generation

text-generation-inference

Model card Files Files and versions Community

AgaMiko commited on Sep 18, 2023

Commit

20e9dc5

•

1 Parent(s): 7a83ceb

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -16,10 +16,11 @@ tags:
 # Academic Trurl 2 -- Polish Llama 2
-The new OPEN TRURL is a finetuned Llama 2, trained on over 1.7b tokens (970k conversational **Polish** and **English** samples) with a large context of 4096 tokens.
 TRURL was trained on a large number of Polish data.
 TRURL 2 is a collection of fine-tuned generative text models with 7 billion and 13 billion parameters.
-This is the repository for the 13B fine-tuned model, optimized for dialogue use cases.
 This model was trained without MMLU dataset.
@@ -37,9 +38,9 @@ This model was trained without MMLU dataset.
 ||Training Data|Params|Content Length|Num. Samples|Num. Tokens|start LR|
 |---|---|---|---|---|---|---|
-|Trurl 2|*A new mix of private and publicly available online data without MMLU*|7B|4k|970k|1.7b|2.0 x 10<sup>-5</sup>|
 |Trurl 2|*A new mix of private and publicly available online data with MMLU*|13B|4k|970k|1.7b|2.0 x 10<sup>-5</sup>|
-|Trurl 2 Academic|*A new mix of private and publicly available online data without MMLU*|13B|4k|970k|1.7b|2.0 x 10<sup>-5</sup>|
 ## Training data

 # Academic Trurl 2 -- Polish Llama 2
+The Academic TRURL is a finetuned Llama 2, trained on over 1.7b tokens (855k conversational **Polish** and **English** samples) with a large context of 4096 tokens.
 TRURL was trained on a large number of Polish data.
 TRURL 2 is a collection of fine-tuned generative text models with 7 billion and 13 billion parameters.
+This is the repository for the Academic 13B fine-tuned model, optimized for dialogue use cases.
 This model was trained without MMLU dataset.
 ||Training Data|Params|Content Length|Num. Samples|Num. Tokens|start LR|
 |---|---|---|---|---|---|---|
+|Trurl 2|*A new mix of private and publicly available online data without MMLU*|7B|4k|855k|1.19b|2.0 x 10<sup>-5</sup>|
 |Trurl 2|*A new mix of private and publicly available online data with MMLU*|13B|4k|970k|1.7b|2.0 x 10<sup>-5</sup>|
+|Trurl 2 Academic|*A new mix of private and publicly available online data without MMLU*|13B|4k|855k|1.19b|2.0 x 10<sup>-5</sup>|
 ## Training data