Safetensors
English
olmo2
amanrangapur commited on
Commit
0fdca51
1 Parent(s): 66afa12

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -63,8 +63,8 @@ The quantized model is more sensitive to data types and CUDA operations. To avoi
63
  inputs.input_ids.to('cuda')
64
  ```
65
 
66
- We have released checkpoints for these models, for every 1000 training steps.
67
- The naming convention is `stepXXX-tokensYYYB`.
68
 
69
  To load a specific model revision with HuggingFace, simply add the argument `revision`:
70
  ```bash
 
63
  inputs.input_ids.to('cuda')
64
  ```
65
 
66
+ We have released checkpoints for these models. For pretraining, the naming convention is `stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
67
+
68
 
69
  To load a specific model revision with HuggingFace, simply add the argument `revision`:
70
  ```bash