tingyuansen
commited on
Commit
•
422e74f
1
Parent(s):
84601cb
Update README.md
Browse files
README.md
CHANGED
@@ -31,7 +31,7 @@ AstroLLaMA-2-70B-Base_AIC is a specialized base language model for astronomy, de
|
|
31 |
- Cosine decay schedule for learning rate reduction
|
32 |
- Training duration: 1 epoch (approximately 2,000 A100 GPU hours)
|
33 |
- **Primary Use**: Next token prediction for astronomy-related text generation and analysis
|
34 |
-
- **Reference**: Pan et al. 2024
|
35 |
|
36 |
## Generating text from a prompt
|
37 |
|
|
|
31 |
- Cosine decay schedule for learning rate reduction
|
32 |
- Training duration: 1 epoch (approximately 2,000 A100 GPU hours)
|
33 |
- **Primary Use**: Next token prediction for astronomy-related text generation and analysis
|
34 |
+
- **Reference**: [Pan et al. 2024](https://arxiv.org/abs/2409.19750)
|
35 |
|
36 |
## Generating text from a prompt
|
37 |
|