Quantizations version
#5 opened 5 days ago
by
baiall

NuMarkdown-8B-reasoning on A100 40GB is extremely slow (even for 1 token)
๐
1
1
#4 opened 11 days ago
by
Fedoration
This is so exciting!
๐ค
5
1
#2 opened 30 days ago
by
ashercn97