Update README.md
Browse files
README.md
CHANGED
@@ -18,7 +18,7 @@ The Meta Llama 3.1 collection of multilingual large language models (LLMs) is a
|
|
18 |
|
19 |
# Quantized Model
|
20 |
|
21 |
-
This repo contains the meta-llama/Meta-Llama-3.1-8B quantized down to INT4 with AutoAWQ using GEMM kernels performing zero-point quantization with a group size of 128.
|
22 |
|
23 |
# Usage Requirements
|
24 |
|
|
|
18 |
|
19 |
# Quantized Model
|
20 |
|
21 |
+
This is the quantizied version of the BASE model of LLama-3.1-8B. This repo contains the meta-llama/Meta-Llama-3.1-8B quantized down to INT4 with AutoAWQ using GEMM kernels performing zero-point quantization with a group size of 128.
|
22 |
|
23 |
# Usage Requirements
|
24 |
|