Intel
/

Meta-Llama-3.1-8B-Instruct-int4-inc

Model card Files Files and versions Community

wenhuach commited on 20 days ago

Commit

8b4ea81

•

1 Parent(s): 274bed5

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -8,19 +8,18 @@ This is a recipe of int4 model with group_size 128 for [meta-llama/Meta-Llama-3.
 ## Reproduce the model
-Here is the sample command to reproduce the model
 ```bash
-git clone https://github.com/intel/auto-round
-cd auto-round/examples/language-modeling
-pip install -r requirements.txt
-python3 main.py \
 --model_name  meta-llama/Meta-Llama-3.1-8B-Instruct \
 --device 0 \
 --group_size 128 \
 --bits 4 \
 --nsamples 512 \
 --iters 1000 \
 --model_dtype "fp16" \
 --deployment_device 'auto_round' \
 --eval_bs 16 \

 ## Reproduce the model
+This is an outdated recipe. We recommend using symmetric quantization by removing '--asym'
 ```bash
+auto-round \
 --model_name  meta-llama/Meta-Llama-3.1-8B-Instruct \
 --device 0 \
 --group_size 128 \
 --bits 4 \
 --nsamples 512 \
 --iters 1000 \
+--asym \
 --model_dtype "fp16" \
 --deployment_device 'auto_round' \
 --eval_bs 16 \