getting very low tokens per second (under 1 t/s) on M2 Ultra 192GB.
#6 opened 4 months ago
by
j4ys0n
vLLM: Unknwon quantization method
#5 opened 5 months ago
by
yaronr
Update README.md
#4 opened 5 months ago
by
manitonga
Upload folder using huggingface_hub
2
#1 opened 5 months ago
by
schroneko
