kaizuberbuehler
/

Alpesteibock-Llama-3-8B-Alpha

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kaizuberbuehler commited on Jun 18, 2024

Commit

853042c

·

verified ·

1 Parent(s): 451dbc4

Update README.md

Files changed (1) hide show

README.md +13 -13

README.md CHANGED Viewed

@@ -12,22 +12,22 @@ license: llama3
 ## Training Details
-Hardware: 1x RTX 4090
-Duration: 30 hours in total (2 hours for first phase and 28 hours for second phase)
 ### Hyperparameters
-Adapter: QLoRA
-Precision: 4 bit
-Optimizer: adamw_bnb_8bit
-LoRA Rank: 256
-LoRA Alpha: 256
-Learning Rate: 1e-5
-Context Length: 4096 tokens
-Batch Size: 1
-Gradient Accumulation Steps: 1
-Sample Packing: Off for first phase, on for second phase
-Epochs: 2
 ## Limitations

 ## Training Details
+Hardware: 1x RTX 4090
+Duration: 30 hours in total (2 hours for first phase and 28 hours for second phase)
 ### Hyperparameters
+Adapter: QLoRA
+Precision: 4 bit
+Optimizer: adamw_bnb_8bit
+LoRA Rank: 256
+LoRA Alpha: 256
+Learning Rate: 1e-5
+Context Length: 4096 tokens
+Batch Size: 1
+Gradient Accumulation Steps: 1
+Sample Packing: Off for first phase, on for second phase
+Epochs: 2
 ## Limitations