matlok
's Collections
Models - Video
updated
VideoPrism: A Foundational Visual Encoder for Video Understanding
Paper
•
2402.13217
•
Published
•
24
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with
Audio2Video Diffusion Model under Weak Conditions
Paper
•
2402.17485
•
Published
•
191
Qwen/Qwen-VL-Chat
Text Generation
•
Updated
•
25.7k
•
350
MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Paper
•
2403.01422
•
Published
•
27
World Model on Million-Length Video And Language With RingAttention
Paper
•
2402.08268
•
Published
•
38
Valley: Video Assistant with Large Language model Enhanced abilitY
Paper
•
2306.07207
•
Published
•
2
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and
Language Models
Paper
•
2306.05424
•
Published
•
7
Lightricks/LTX-Video
Image-to-Video
•
Updated
•
85k
•
907
genmo/mochi-1-preview
Text-to-Video
•
Updated
•
39.9k
•
•
1.16k
FastVideo/FastHunyuan
Text-to-Video
•
Updated
•
896
•
163
FastVideo/FastMochi-diffusers
Text-to-Video
•
Updated
•
99
•
16
tencent/HunyuanVideo
Text-to-Video
•
Updated
•
7.62k
•
•
1.54k