Learning an Image Editing Model without Image Editing Pairs Paper • 2510.14978 • Published 4 days ago • 6
UniVideo: Unified Understanding, Generation, and Editing for Videos Paper • 2510.08377 • Published 11 days ago • 65
Self-Forcing++: Towards Minute-Scale High-Quality Video Generation Paper • 2510.02283 • Published 18 days ago • 89
UniVid: Unifying Vision Tasks with Pre-trained Video Generation Models Paper • 2509.21760 • Published 25 days ago • 14
RecA Collection Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning! • 8 items • Updated 28 days ago • 12
X-UniMotion: Animating Human Images with Expressive, Unified and Identity-Agnostic Motion Latents Paper • 2508.09383 • Published Aug 12 • 1
Lynx: Towards High-Fidelity Personalized Video Generation Paper • 2509.15496 • Published Sep 19 • 12
WorldForge: Unlocking Emergent 3D/4D Generation in Video Diffusion Model via Training-Free Guidance Paper • 2509.15130 • Published Sep 18 • 30
Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation Paper • 2506.04225 • Published Jun 4 • 28
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency Paper • 2508.18265 • Published Aug 25 • 201
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation Paper • 2508.11255 • Published Aug 15 • 11
Lumos-1: On Autoregressive Video Generation from a Unified Model Perspective Paper • 2507.08801 • Published Jul 11 • 30
LiON-LoRA: Rethinking LoRA Fusion to Unify Controllable Spatial and Temporal Generation for Video Diffusion Paper • 2507.05678 • Published Jul 8 • 1
RealisVSR: Detail-enhanced Diffusion for Real-World 4K Video Super-Resolution Paper • 2507.19138 • Published Jul 25 • 1
nablaNABLA: Neighborhood Adaptive Block-Level Attention Paper • 2507.13546 • Published Jul 17 • 123
PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation Paper • 2507.16116 • Published Jul 22 • 10
Upsample What Matters: Region-Adaptive Latent Sampling for Accelerated Diffusion Transformers Paper • 2507.08422 • Published Jul 11 • 36