microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • Updated about 22 hours ago • 441k • 1.12k
24GB VRAM Optimal Quants Collection When asked what I use locally on a 24GB card, this is what I point to. I favor exl2s for long context, GGUF for very short context. • 12 items • Updated Oct 31, 2024 • 3