mwitiderrick
commited on
Commit
•
005d074
1
Parent(s):
263e235
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ quantized_by: mwitiderrick
|
|
10 |
tags:
|
11 |
- deepsparse
|
12 |
---
|
13 |
-
## TinyLlama 1.1B Chat 0
|
14 |
This repo contains model files for [TinyLlama 1.1B Chat](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
|
15 |
|
16 |
This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
|
|
|
10 |
tags:
|
11 |
- deepsparse
|
12 |
---
|
13 |
+
## TinyLlama 1.1B Chat 1.0 - DeepSparse
|
14 |
This repo contains model files for [TinyLlama 1.1B Chat](https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0) optimized for [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
|
15 |
|
16 |
This model was quantized and pruned with [SparseGPT](https://arxiv.org/abs/2301.00774), using [SparseML](https://github.com/neuralmagic/sparseml).
|