pkupie
/

Llama-2-7b-bod

Model card Files Files and versions Community

KobayashiKanna01 commited on 13 days ago

Commit

bdea023

•

1 Parent(s): c2f91b5

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -7,4 +7,15 @@ language:
 - bo
 base_model:
 - meta-llama/Llama-2-7b-hf
----

 - bo
 base_model:
 - meta-llama/Llama-2-7b-hf
+---
+A continually pre-trained model based on Llama-2-7b-hf.
+We use the **Tibetan texts** in MC^2 and **English texts** in RedPajama with a proportion of **4:1** for training.
+#### Hyper-parameters:
+ * lr: 3e-5
+ * batch size: 1M (2K*512)
+ * lr scheduler: cosine
+ * min lr: 1e-6
+ * lr decay iters: 10240