Is it using ggml to compute?
#30
by
CHNtentes
- opened
Or just dequant from gguf then use transformers
We're just using it as a storage format so we dequant on the fly and use the code in ComfyUI (which is why it's the reference format not the diffusers one). Using the ggml.dll kernes would be nice though.