Multimodal GGUFs Collection Vision and audio models compatible with llama-server and llama-mtmd-cli • 14 items • Updated 6 days ago • 13
facebook/dinov3-vitl16-pretrain-lvd1689m Image Feature Extraction • 0.3B • Updated Aug 19 • 305k • 55
google/vit-base-patch16-224-in21k Image Feature Extraction • 86.4M • Updated Feb 5, 2024 • 1.89M • 383