Text Generation
Transformers
ONNX
llama
sparse
code
deepsparse
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -14,7 +14,7 @@ tags:
14
 
15
  # Llama-2-7b-pruned70-retrained-evolcodealpaca-quant-ds
16
 
17
- This repo contains a [70% sparse Llama 2 7B](https://huggingface.co/neuralmagic/Llama-2-7b-pruned70-retrained) finetuned for code generation tasks using the [Evolved CodeAlpaca](https://huggingface.co/datasets/theblackcat102/evol-codealpaca-v1) dataset.
18
  It was then quantized to 8-bit weights + activations and exported to deploy with [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
19
 
20
  **Authors**: Neural Magic, Cerebras
@@ -46,9 +46,9 @@ print(outputs.generations[0].text)
46
 
47
  Model evaluation metrics and results.
48
 
49
- | Benchmark | Metric | Llama-2-7b-instruct | Llama-2-7b-pruned70-retrained-evolcodealpaca-quant-ds |
50
  |------------------------------------------------|---------------|-------------|-------------------------------|
51
- | [HumanEval](https://arxiv.org/abs/2107.03374) | pass@1 | xxxx | xxxx |
52
 
53
  ## Help
54
 
 
14
 
15
  # Llama-2-7b-pruned70-retrained-evolcodealpaca-quant-ds
16
 
17
+ This repo contains a [70% sparse Llama 2 7B](https://huggingface.co/neuralmagic/Llama-2-7b-pruned70-retrained-evolcodealpaca) finetuned for code generation tasks using the [Evolved CodeAlpaca](https://huggingface.co/datasets/theblackcat102/evol-codealpaca-v1) dataset.
18
  It was then quantized to 8-bit weights + activations and exported to deploy with [DeepSparse](https://github.com/neuralmagic/deepsparse), a CPU inference runtime for sparse models.
19
 
20
  **Authors**: Neural Magic, Cerebras
 
46
 
47
  Model evaluation metrics and results.
48
 
49
+ | Benchmark | Metric | Llama-2-7b-evolcodealpaca | Llama-2-7b-pruned70-retrained-evolcodealpaca-quant-ds |
50
  |------------------------------------------------|---------------|-------------|-------------------------------|
51
+ | [HumanEval](https://arxiv.org/abs/2107.03374) | pass@1 | 32.03 | 34.76 |
52
 
53
  ## Help
54