allenai
/

OLMo-2-1124-7B

Model card Files Files and versions Community

amanrangapur commited on 8 days ago

Commit

0fdca51

•

1 Parent(s): 66afa12

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -63,8 +63,8 @@ The quantized model is more sensitive to data types and CUDA operations. To avoi
 inputs.input_ids.to('cuda')
 ```
-We have released checkpoints for these models, for every 1000 training steps.
-The naming convention is `stepXXX-tokensYYYB`.
 To load a specific model revision with HuggingFace, simply add the argument `revision`:
 ```bash

 inputs.input_ids.to('cuda')
 ```
+We have released checkpoints for these models. For pretraining, the naming convention is `stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
 To load a specific model revision with HuggingFace, simply add the argument `revision`:
 ```bash