nm-testing
/

Llama-2-7b-hf-pruned50-quant-ds

Text Generation

Model card Files Files and versions Community

mwitiderrick commited on Dec 20, 2023

Commit

b55e5b3

•

1 Parent(s): 58516fe

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -10,7 +10,7 @@ quantized_by: mwitiderrick
 tags:
 - deepsparse
 ---
-# Nous-Hermes-Llama2-7b - DeepSparse
 This repo contains model files for [Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
 This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).

 tags:
 - deepsparse
 ---
+# Llama2-7b - DeepSparse
 This repo contains model files for [Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
 This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).