-
PEEKABOO: Interactive Video Generation via Masked-Diffusion
Paper • 2312.07509 • Published • 7 -
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Paper • 2403.14773 • Published • 10 -
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding
Paper • 2403.15377 • Published • 22 -
Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition
Paper • 2403.14148 • Published • 18
Goodman
Motolov
·
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 2 months ago
From Generalist to Specialist: Adapting Vision Language Models via
Task-Specific Visual Instruction Tuning
Organizations
Collections
7
models
None public yet
datasets
None public yet