team update
Browse files
README.md
CHANGED
@@ -209,7 +209,7 @@ model-index:
|
|
209 |
**Model Summary:**
|
210 |
Granite-3.0-3B-A800M-Base is a decoder-only language model to support a variety of text-to-text generation tasks. It is trained from scratch following a two-stage training strategy. In the first stage, it is trained on 8 trillion tokens sourced from diverse domains. During the second stage, it is further trained on 2 trillion tokens using a carefully curated mix of high-quality data, aiming to enhance its performance on specific tasks.
|
211 |
|
212 |
-
- **Developers:** IBM
|
213 |
- **GitHub Repository:** [ibm-granite/granite-3.0-language-models](https://github.com/ibm-granite/granite-3.0-language-models)
|
214 |
- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
|
215 |
- **Paper:** [Granite 3.0 Language Models](https://github.com/ibm-granite/granite-3.0-language-models/blob/main/paper.pdf)
|
|
|
209 |
**Model Summary:**
|
210 |
Granite-3.0-3B-A800M-Base is a decoder-only language model to support a variety of text-to-text generation tasks. It is trained from scratch following a two-stage training strategy. In the first stage, it is trained on 8 trillion tokens sourced from diverse domains. During the second stage, it is further trained on 2 trillion tokens using a carefully curated mix of high-quality data, aiming to enhance its performance on specific tasks.
|
211 |
|
212 |
+
- **Developers:** Granite Team, IBM
|
213 |
- **GitHub Repository:** [ibm-granite/granite-3.0-language-models](https://github.com/ibm-granite/granite-3.0-language-models)
|
214 |
- **Website**: [Granite Docs](https://www.ibm.com/granite/docs/)
|
215 |
- **Paper:** [Granite 3.0 Language Models](https://github.com/ibm-granite/granite-3.0-language-models/blob/main/paper.pdf)
|