Would you release any quantized version? Would like to see GPTQ quant as well.
I think they would be coming soon.
yes! please .. can not load 14g model into memory directly
· Sign up or log in to comment