Unifying Specialized Visual Encoders for Video Language Models Paper • 2501.01426 • Published 21 days ago • 21
AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark Paper • 2410.03051 • Published Oct 4, 2024 • 6