Any guideline on how to quantatize to GPTQ 4bits?
#20
by
victorx98
- opened
how to get a 4bits quantatized version? Thanks
You need refer to AutoGPTQ github repo. I have tried, But 4bits-quantized effect is bad, I don't know why.
You need refer to AutoGPTQ github repo. I have tried, But 4bits-quantized effect is bad, I don't know why.
Thanks. Have you tried GGML quantization?
You need refer to AutoGPTQ github repo. I have tried, But 4bits-quantized effect is bad, I don't know why.
Thanks. Have you tried GGML quantization?
I am sorry, I haven't.
wuzhiying2023
changed discussion status to
closed