rpand002 commited on
Commit
3fd1f32
1 Parent(s): 174c7f4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -8,6 +8,8 @@ tags:
8
 
9
  # Granite-3.1-8B-Base
10
 
 
 
11
  **Model Summary:**
12
  Granite-3.1-8B-Base extends the context length of Granite-3.0-8B-Base from 4K to 128K using a progressive training strategy by increasing the supported context length in increments while adjusting RoPE theta until the model has successfully adapted to desired length of 128K. This long-context pre-training stage was performed using approximately 500B tokens.
13
 
 
8
 
9
  # Granite-3.1-8B-Base
10
 
11
+ **Note: We are continuously improving our models and recommend users to checkout our latest [Granite 3.1](https://huggingface.co/collections/ibm-granite/granite-31-language-models-6751dbbf2f3389bec5c6f02d) models.**
12
+
13
  **Model Summary:**
14
  Granite-3.1-8B-Base extends the context length of Granite-3.0-8B-Base from 4K to 128K using a progressive training strategy by increasing the supported context length in increments while adjusting RoPE theta until the model has successfully adapted to desired length of 128K. This long-context pre-training stage was performed using approximately 500B tokens.
15