FastVLM: Efficient Vision Encoding for Vision Language Models
Paper
• 2412.13303 • Published
• 75
Efficient Vision Encoding for Vision Language Models
Real-time video captioning powered by FastVLM
Note MLX checkpoint
Note MLX checkpoint
Note MLX checkpoint