ZeroCool94
commited on
Commit
•
110f60a
1
Parent(s):
9790fc9
Update README.md
Browse files
README.md
CHANGED
@@ -79,10 +79,10 @@ The model was trained on the following dataset:
|
|
79 |
- **Hardware:** 1 x Nvidia RTX 3050 8GB GPU
|
80 |
- **Hours Trained:** 520 hours approximately.
|
81 |
- **Optimizer:** AdamW
|
82 |
-
- **Gradient Accumulations**:
|
83 |
- **Batch:** 1
|
84 |
- **Learning rate:** warmup to 1e-7 for 10,000 steps and then kept constant
|
85 |
-
- **Total Training Steps:** 1,
|
86 |
|
87 |
Developed by: [ZeroCool94](https://huggingface.co/ZeroCool94) at [Sygil-Dev](https://github.com/Sygil-Dev/)
|
88 |
|
|
|
79 |
- **Hardware:** 1 x Nvidia RTX 3050 8GB GPU
|
80 |
- **Hours Trained:** 520 hours approximately.
|
81 |
- **Optimizer:** AdamW
|
82 |
+
- **Gradient Accumulations**: 4
|
83 |
- **Batch:** 1
|
84 |
- **Learning rate:** warmup to 1e-7 for 10,000 steps and then kept constant
|
85 |
+
- **Total Training Steps:** 1,489,983
|
86 |
|
87 |
Developed by: [ZeroCool94](https://huggingface.co/ZeroCool94) at [Sygil-Dev](https://github.com/Sygil-Dev/)
|
88 |
|