jinaai
/

falcon-40b-code-alpaca

Text Generation

feature-extraction

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sebaweis commited on Jul 18, 2023

Commit

9626c37

•

1 Parent(s): 1b37982

Update README.md

Files changed (1) hide show

README.md +6 -7

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ This version of the weights was trained with the following hyperparameters:
 - Batch size: 128
 - Micro batch size: 4
 - Learning rate: 3e-4
-- Lora _r_: 16
 - Lora target modules: query_key_value
 You can reproduce using this repository:
@@ -29,10 +29,9 @@ Make sure you install requirements and finetune using this command using the fol
 ```
 python finetune.py \
-    --base-model='tiiuae/falcon-40b' \
-    --num-epochs=2 \
-    --output-dir='./jinaai/falcon-7b' \
-    --lora-target-modules=query_key_value \
-    --lora-r=16 \
-    --micro-batch-size=4
 ```

 - Batch size: 128
 - Micro batch size: 4
 - Learning rate: 3e-4
+- Lora _r_: 8
 - Lora target modules: query_key_value
 You can reproduce using this repository:
 ```
 python finetune.py \
+--base-model tiiuae/falcon-40b --lora-target-modules query_key_value \
+--data-path sahil2801/CodeAlpaca-20k --output-dir ./lora-alpaca-code \
+--batch-size 128 --micro-batch-size 4 --eval-limit 45 \
+--eval-file code_eval.jsonl --wandb-project jerboa --wandb-log-model \
+--wandb-watch gradients --num-epochs 2
 ```