izumi-lab
/

stormy-7b-10ep

Model card Files Files and versions Community

retarfi commited on Jun 1, 2023

Commit

a1bebd9

•

1 Parent(s): 738a4f7

Modify README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -5,7 +5,8 @@ datasets:
 language:
 - ja
 tags:
-- llama
 - causal-lm
 ---
@@ -17,11 +18,11 @@ You can test this at https://huggingface.co/spaces/izumi-lab/stormy-7b-10ep
 This version of the weights was trained with the following hyperparameters:
 - Epochs: 10
-- Batch size: 130
-- Cutoff length: 256
 - Learning rate: 3e-4
 - Lora _r_: 4
-- Lora target modules: q_proj, v_proj
 ```python
 import torch

 language:
 - ja
 tags:
+- gpt_neox
+- japanese
 - causal-lm
 ---
 This version of the weights was trained with the following hyperparameters:
 - Epochs: 10
+- Batch size: 128
+- Cutoff length: 300
 - Learning rate: 3e-4
 - Lora _r_: 4
+- Lora target modules: query_key_value
 ```python
 import torch