adamo1139 commited on
Commit
62d89d6
·
verified ·
1 Parent(s): e841320

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md ADDED
@@ -0,0 +1 @@
 
 
1
+ It's Danube3 500M with <|im_start|> and <|im_end|> tags trained in. First, one LoRA finetune on the uninstruct dataset, then I modified the vocabulary and ran a second LoRA, this time including embed_tokens and lm_head. Useful for quick hyperparameter optimization where you don't want to train lm_head and embed_tokens but you want to have <|im_start|> and <|im_end|> tokens trained in already.