ibm-granite
/

granite-3.1-8b-base

Text Generation

Inference Endpoints

Model card Files Files and versions Community

rpand002 commited on Dec 19, 2024

Commit

3fd1f32

·

verified ·

1 Parent(s): 174c7f4

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -8,6 +8,8 @@ tags:
 # Granite-3.1-8B-Base
 **Model Summary:**
 Granite-3.1-8B-Base extends the context length of Granite-3.0-8B-Base from 4K to 128K using a progressive training strategy by increasing the supported context length in increments while adjusting RoPE theta until the model has successfully adapted to desired length of 128K. This long-context pre-training stage was performed using approximately 500B tokens.

 # Granite-3.1-8B-Base
+**Note: We are continuously improving our models and recommend users to checkout our latest [Granite 3.1](https://huggingface.co/collections/ibm-granite/granite-31-language-models-6751dbbf2f3389bec5c6f02d) models.**
 **Model Summary:**
 Granite-3.1-8B-Base extends the context length of Granite-3.0-8B-Base from 4K to 128K using a progressive training strategy by increasing the supported context length in increments while adjusting RoPE theta until the model has successfully adapted to desired length of 128K. This long-context pre-training stage was performed using approximately 500B tokens.