Update README.md
Browse files
README.md
CHANGED
@@ -18,12 +18,14 @@ This repo contains model files for a [Llama 2 7B](https://huggingface.co/meta-ll
|
|
18 |
|
19 |
Below we share some code snippets on how to get quickly started with running the model.
|
20 |
|
21 |
-
### Fine-tuning examples
|
22 |
|
23 |
Coming soon.
|
24 |
|
25 |
### Running the model
|
26 |
|
|
|
|
|
27 |
```python
|
28 |
# pip install transformers accelerate
|
29 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|
|
|
18 |
|
19 |
Below we share some code snippets on how to get quickly started with running the model.
|
20 |
|
21 |
+
### Sparse Fine-tuning examples
|
22 |
|
23 |
Coming soon.
|
24 |
|
25 |
### Running the model
|
26 |
|
27 |
+
This model has not been fine-tuned for instruction-following but may be run with the transformers library. For accelerated inference with sparsity, deploy with [nm-vllm](https://github.com/neuralmagic/nm-vllm) or [deepsparse](https://github.com/neuralmagic/deepsparse).
|
28 |
+
|
29 |
```python
|
30 |
# pip install transformers accelerate
|
31 |
from transformers import AutoTokenizer, AutoModelForCausalLM
|