Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated about 1 month ago • 85.6k • 446 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 3.56k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 163 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 3.56k • 48
Vision-Language moonshotai/Kimi-VL-A3B-Thinking Image-Text-to-Text • 16B • Updated about 1 month ago • 85.6k • 446 OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 3.56k • 48 LifuWang/DistillT5 0.1B • Updated Apr 11, 2025 • 163 • 29
OpenGVLab/InternViT-300M-448px-V2_5 Image Feature Extraction • 0.3B • Updated Dec 9, 2024 • 3.56k • 48