mgoin commited on
Commit
7883e52
1 Parent(s): c499065

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -14,7 +14,7 @@ This repo contains model files for [llama2.c 110M tinystories](https://huggingfa
14
  This model was pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
15
 
16
  ## Inference
17
- Install [NM-vLLM](https://github.com/neuralmagic/nm-vllm) for fast inference:
18
  ```bash
19
  pip install nm-vllm[sparse]
20
  ```
 
14
  This model was pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
15
 
16
  ## Inference
17
+ Install [NM-vLLM](https://github.com/neuralmagic/nm-vllm) for fast inference and low memory-usage:
18
  ```bash
19
  pip install nm-vllm[sparse]
20
  ```