AnyStory: Towards Unified Single and Multiple Subject Personalization in Text-to-Image Generation Paper • 2501.09503 • Published 15 days ago • 13
Do generative video models learn physical principles from watching videos? Paper • 2501.09038 • Published 16 days ago • 31
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 16 days ago • 61
EMO2: End-Effector Guided Audio-Driven Avatar Video Generation Paper • 2501.10687 • Published 13 days ago • 12
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper • 2501.12224 • Published 10 days ago • 46
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 21 days ago • 87
Multi-subject Open-set Personalization in Video Generation Paper • 2501.06187 • Published 20 days ago • 13
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Paper • 2501.09732 • Published 14 days ago • 66
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature Paper • 2501.07171 • Published 18 days ago • 49