libkazz
/

llm-jp-3-13b-it_lora

text-generation-inference

Model card Files Files and versions Community

libkazz commited on Dec 17, 2024

Commit

9a26811

·

verified ·

1 Parent(s): f4b9fb8

Update README.md

Files changed (1) hide show

README.md +1 -3

README.md CHANGED Viewed

@@ -100,9 +100,7 @@ def select_best_response(file_path, output_dir):
 * 4bit量子化
 * LoRAによるSFT
 * learning_rate = 2e-4
-* optim="adamw_torch_fused"
-* lr_scheduler_type="cosine"
-* weight_decay=0.01
 ## Bias, Risks, and Limitations
 RLHF，DPOを実施していないため不適切な表現が出力される可能性があります。

 * 4bit量子化
 * LoRAによるSFT
 * learning_rate = 2e-4
+* num_train_epochs = 2
 ## Bias, Risks, and Limitations
 RLHF，DPOを実施していないため不適切な表現が出力される可能性があります。