BatsResearch
/

bonito-v1

Text2Text Generation

text-generation

data generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

nihalnayak commited on Feb 27

Commit

1e33e79

•

1 Parent(s): a8e503d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -123,7 +123,7 @@ The training takes about 4 days on four GPUs to complete.
 We use the following hyperparameters:
   - Q-LoRA rank (r): 64
-  - Q-LoRA scaling factor ($alpha$): 4
   - Q-LoRA dropout: 0
   - Optimizer: Paged AdamW
   - Learning rate scheduler: linear

 We use the following hyperparameters:
   - Q-LoRA rank (r): 64
+  - Q-LoRA scaling factor (alpha): 4
   - Q-LoRA dropout: 0
   - Optimizer: Paged AdamW
   - Learning rate scheduler: linear