Quantizedd size warning

#9
by bayang - opened

Is it normal to have the quantized bigger than the .safetensors files?

Google org

The GGUF is not necessarily a quantized file. It's likely a full-precision checkpoint while the safetensors is half precision

Gotcha, thanks

bayang changed discussion status to closed

Sign up or log in to comment