Default to eager attention
pinned
2
#1 opened 7 months ago
by
lysandre
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5e3aec01f55e2b62848a5217/PMKS0NNB4MJQlTSFzh918.jpeg)
Which Chinese languages does this model speak?
#5 opened 3 months ago
by
glenng
![](https://cdn-avatars.huggingface.co/v1/production/uploads/64fdf76e01aedd0e8600c70a/md4DRT8hFymwX_6_rGLx5.jpeg)
Can anyone provider gptq-4bit or awq version for this model?
#4 opened 5 months ago
by
esoterikx
vllm load model error
1
#3 opened 7 months ago
by
Dharma0818
How much GPU memory is required to run this model?
#2 opened 7 months ago
by
Dharma0818