Default to eager attention
pinned
2
#1 opened 5 months ago
by
lysandre
Which Chinese languages does this model speak?
#5 opened 2 days ago
by
glenng
Can anyone provider gptq-4bit or awq version for this model?
#4 opened 3 months ago
by
esoterikx
vllm load model error
1
#3 opened 4 months ago
by
Dharma0818
How much GPU memory is required to run this model?
#2 opened 4 months ago
by
Dharma0818