smpanaro commited on
Commit
ee1bada
1 Parent(s): 9e80c9f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -5,7 +5,7 @@ metrics:
5
  - perplexity
6
  base_model:
7
  - meta-llama/Llama-2-7b-hf
8
- quantized_by: smpanaro
9
  ---
10
  **N**on-**u**niform **GPTQ** (NuGPTQ) combines [GPTQ](https://arxiv.org/abs/2210.17323), [SqueezeLLM](https://arxiv.org/abs/2306.07629) and [output scaling](https://stephenpanaro.com/blog/llm-quantization-for-iphone) for a competitive whole-tensor (no grouping) LLM compression method.
11
 
 
5
  - perplexity
6
  base_model:
7
  - meta-llama/Llama-2-7b-hf
8
+ base_model_relation: quantized
9
  ---
10
  **N**on-**u**niform **GPTQ** (NuGPTQ) combines [GPTQ](https://arxiv.org/abs/2210.17323), [SqueezeLLM](https://arxiv.org/abs/2306.07629) and [output scaling](https://stephenpanaro.com/blog/llm-quantization-for-iphone) for a competitive whole-tensor (no grouping) LLM compression method.
11