Information about performance of different quantisation options
#1
by
krumeto
- opened
Congratulations on a fantastic release!
Would you have some information on performance of the GGUF
options? It would be very helpful to know how much of the model quality is retained at different quantisation levels. Even anecdotical evidence would be sufficient.
Thank you in advance,
Krum