victormiller
commited on
Commit
•
d01d003
1
Parent(s):
0307054
Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,9 @@ During the first K2 training phase, we encountered two loss spikes.
|
|
6 |
|
7 |
<img src="k2_spike_1.png" alt="k2 spike 1"/>
|
8 |
|
|
|
|
|
|
|
9 |
## About the LLM360 Research Suite
|
10 |
The LLM360 Research Suite is a comprehensive set of large language model (LLM) artifacts from Amber, CrystalCoder, and K2 for academic and industry researchers to explore LLM training dynamics. Additional resources can be found at llm360.ai.
|
11 |
|
|
|
6 |
|
7 |
<img src="k2_spike_1.png" alt="k2 spike 1"/>
|
8 |
|
9 |
+
## Uses
|
10 |
+
Loss spikes are still a relatively unknown phenomena. By making these spikes and associated training details available, we hope others use these artifacts to further the worlds knowledge on this topic.
|
11 |
+
|
12 |
## About the LLM360 Research Suite
|
13 |
The LLM360 Research Suite is a comprehensive set of large language model (LLM) artifacts from Amber, CrystalCoder, and K2 for academic and industry researchers to explore LLM training dynamics. Additional resources can be found at llm360.ai.
|
14 |
|