Update README.md
Browse files
README.md
CHANGED
@@ -5,7 +5,7 @@ metrics:
|
|
5 |
- perplexity
|
6 |
base_model:
|
7 |
- meta-llama/Llama-2-7b-hf
|
8 |
-
|
9 |
---
|
10 |
**N**on-**u**niform **GPTQ** (NuGPTQ) combines [GPTQ](https://arxiv.org/abs/2210.17323), [SqueezeLLM](https://arxiv.org/abs/2306.07629) and [output scaling](https://stephenpanaro.com/blog/llm-quantization-for-iphone) for a competitive whole-tensor (no grouping) LLM compression method.
|
11 |
|
|
|
5 |
- perplexity
|
6 |
base_model:
|
7 |
- meta-llama/Llama-2-7b-hf
|
8 |
+
base_model_relation: quantized
|
9 |
---
|
10 |
**N**on-**u**niform **GPTQ** (NuGPTQ) combines [GPTQ](https://arxiv.org/abs/2210.17323), [SqueezeLLM](https://arxiv.org/abs/2306.07629) and [output scaling](https://stephenpanaro.com/blog/llm-quantization-for-iphone) for a competitive whole-tensor (no grouping) LLM compression method.
|
11 |
|