Models - Video
updated
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
• 2402.13217
• Published
• 38
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
• 2402.17485
• Published
• 194
Text Generation
• Updated
• 125k
• 381
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
• 2403.01422
• Published
• 30
World Model on Million-Length Video And Language With RingAttention
Paper
• 2402.08268
• Published
• 40
Valley: Video Assistant with Large Language model Enhanced abilitY
Paper
• 2306.07207
• Published
• 3
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and
Language Models
Paper
• 2306.05424
• Published
• 7
Image-to-Video
• Updated
• 323k
• • 2.11k
Text-to-Video
• Updated
• 5.62k
• • 1.31k
Text-to-Video
• Updated
• 21
• 191
FastVideo/FastMochi-diffusers
Text-to-Video
• Updated
• 8
• 19
Text-to-Video
• Updated
• 1.09k
• • 2.13k