Abouth w8a8 method

#1
by a-r-c - opened

What quantize method you use

GPTQ Int8 for weights.

Is this padded for multi-GPU use?

Sign up or log in to comment