mwitiderrick
commited on
Commit
•
b55e5b3
1
Parent(s):
58516fe
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ quantized_by: mwitiderrick
|
|
10 |
tags:
|
11 |
- deepsparse
|
12 |
---
|
13 |
-
#
|
14 |
This repo contains model files for [Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
|
15 |
|
16 |
This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
|
|
|
10 |
tags:
|
11 |
- deepsparse
|
12 |
---
|
13 |
+
# Llama2-7b - DeepSparse
|
14 |
This repo contains model files for [Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
|
15 |
|
16 |
This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
|