victormiller
commited on
Commit
•
af15a69
1
Parent(s):
4971892
Update README.md
Browse files
README.md
CHANGED
@@ -2,7 +2,7 @@
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
# LLM360 Research Suite: K2 Loss Spike 1
|
5 |
-
We encountered two major loss spikes while training K2.
|
6 |
* The first loss spike occured after X checkpoints and lasted over ~34 checkpoints. We restarted training at checkpoint X and training returned to normal.
|
7 |
* The [second loss spike](https://huggingface.co/LLM360/K2-Spike-2/) occured after restarting training to fix the first loss spike at checkpoint X and lasted from ~8 checkpoints.
|
8 |
|
|
|
2 |
license: apache-2.0
|
3 |
---
|
4 |
# LLM360 Research Suite: K2 Loss Spike 1
|
5 |
+
We encountered two major loss spikes while [training K2](https://huggingface.co/LLM360/K2).
|
6 |
* The first loss spike occured after X checkpoints and lasted over ~34 checkpoints. We restarted training at checkpoint X and training returned to normal.
|
7 |
* The [second loss spike](https://huggingface.co/LLM360/K2-Spike-2/) occured after restarting training to fix the first loss spike at checkpoint X and lasted from ~8 checkpoints.
|
8 |
|