ajibawa-2023
commited on
Commit
•
b177abf
1
Parent(s):
0f74d6f
Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ This is Fully Finetuned Model. Quantize models will be available soon.
|
|
25 |
Publishing anything this model generates is the same as publishing it yourself. I am not responsible for what you generate using this model.
|
26 |
|
27 |
**Training:**
|
28 |
-
Entire dataset was trained on
|
29 |
|
30 |
**GGUF & Exllama**
|
31 |
|
|
|
25 |
Publishing anything this model generates is the same as publishing it yourself. I am not responsible for what you generate using this model.
|
26 |
|
27 |
**Training:**
|
28 |
+
Entire dataset was trained on 4 x A100 80GB. For 3 epoch, training took around 6 hours. Axolotl & DeepSpeed codebase was used for training purpose. This was trained on Llama-3-8B model by Meta.
|
29 |
|
30 |
**GGUF & Exllama**
|
31 |
|