Update README.md
Browse files
README.md
CHANGED
@@ -18,10 +18,6 @@ To download Zamba2-2.7B-instruct, clone Zyphra's fork of transformers:
|
|
18 |
4. `pip install accelerate`
|
19 |
|
20 |
|
21 |
-
You can run the model without using the optimized Mamba2 kernels, but it is **not** recommended as it will result in significantly higher latency and memory usage.
|
22 |
-
|
23 |
-
To run on CPU, please specify `use_mamba_kernels=False` when loading the model using ``AutoModelForCausalLM.from_pretrained``.
|
24 |
-
|
25 |
### Inference
|
26 |
|
27 |
```python
|
|
|
18 |
4. `pip install accelerate`
|
19 |
|
20 |
|
|
|
|
|
|
|
|
|
21 |
### Inference
|
22 |
|
23 |
```python
|