smpanaro commited on
Commit
9e80c9f
·
verified ·
1 Parent(s): 469fa70

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -5,6 +5,7 @@ metrics:
5
  - perplexity
6
  base_model:
7
  - meta-llama/Llama-2-7b-hf
 
8
  ---
9
  **N**on-**u**niform **GPTQ** (NuGPTQ) combines [GPTQ](https://arxiv.org/abs/2210.17323), [SqueezeLLM](https://arxiv.org/abs/2306.07629) and [output scaling](https://stephenpanaro.com/blog/llm-quantization-for-iphone) for a competitive whole-tensor (no grouping) LLM compression method.
10
 
 
5
  - perplexity
6
  base_model:
7
  - meta-llama/Llama-2-7b-hf
8
+ quantized_by: smpanaro
9
  ---
10
  **N**on-**u**niform **GPTQ** (NuGPTQ) combines [GPTQ](https://arxiv.org/abs/2210.17323), [SqueezeLLM](https://arxiv.org/abs/2306.07629) and [output scaling](https://stephenpanaro.com/blog/llm-quantization-for-iphone) for a competitive whole-tensor (no grouping) LLM compression method.
11