Nobody felt like quantizing this model?

#2
by ElvisM - opened

Weird. Usually GGUF quants pop out in the first hour.

New architecture, it'll take time for the popular inference engines / quant libs to be updated to support it.

Looks like there's an MLX pull request with support if you have a mac

https://github.com/ml-explore/mlx-examples/pull/1157

Sign up or log in to comment