Any guideline on how to quantatize to GPTQ 4bits?

#20
by victorx98 - opened

how to get a 4bits quantatized version? Thanks

Baichuan Intelligent Technology org

You need refer to AutoGPTQ github repo. I have tried, But 4bits-quantized effect is bad, I don't know why.

You need refer to AutoGPTQ github repo. I have tried, But 4bits-quantized effect is bad, I don't know why.

Thanks. Have you tried GGML quantization?

Baichuan Intelligent Technology org

You need refer to AutoGPTQ github repo. I have tried, But 4bits-quantized effect is bad, I don't know why.

Thanks. Have you tried GGML quantization?

I am sorry, I haven't.

wuzhiying2023 changed discussion status to closed

Sign up or log in to comment