vllm / sglang support?

#2
by CHNtentes - opened

It's painfully slow to run MoE models with transformers...

Sign up or log in to comment