amanrangapur
commited on
Commit
•
0fdca51
1
Parent(s):
66afa12
Update README.md
Browse files
README.md
CHANGED
@@ -63,8 +63,8 @@ The quantized model is more sensitive to data types and CUDA operations. To avoi
|
|
63 |
inputs.input_ids.to('cuda')
|
64 |
```
|
65 |
|
66 |
-
We have released checkpoints for these models,
|
67 |
-
|
68 |
|
69 |
To load a specific model revision with HuggingFace, simply add the argument `revision`:
|
70 |
```bash
|
|
|
63 |
inputs.input_ids.to('cuda')
|
64 |
```
|
65 |
|
66 |
+
We have released checkpoints for these models. For pretraining, the naming convention is `stepXXX-tokensYYYB`. For checkpoints with ingredients of the soup, the naming convention is `stage2-ingredientN-stepXXX-tokensYYYB`
|
67 |
+
|
68 |
|
69 |
To load a specific model revision with HuggingFace, simply add the argument `revision`:
|
70 |
```bash
|