CineScale: Free Lunch in High-Resolution Cinematic Visual Generation Paper • 2508.15774 • Published 14 days ago • 19
S^2-Guidance: Stochastic Self Guidance for Training-Free Enhancement of Diffusion Models Paper • 2508.12880 • Published 17 days ago • 45
Story2Board: A Training-Free Approach for Expressive Storyboard Generation Paper • 2508.09983 • Published 22 days ago • 67
Stand-In: A Lightweight and Plug-and-Play Identity Control for Video Generation Paper • 2508.07901 • Published 24 days ago • 39
UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation Paper • 2508.05399 • Published 28 days ago • 16
Follow-Your-Shape: Shape-Aware Image Editing via Trajectory-Guided Region Control Paper • 2508.08134 • Published 24 days ago • 9
LongVie: Multimodal-Guided Controllable Ultra-Long Video Generation Paper • 2508.03694 • Published about 1 month ago • 50
DreamVVT: Mastering Realistic Video Virtual Try-On in the Wild via a Stage-Wise Diffusion Transformer Framework Paper • 2508.02807 • Published Aug 4 • 13
CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning Paper • 2507.14111 • Published Jul 18 • 23
π^3: Scalable Permutation-Equivariant Visual Geometry Learning Paper • 2507.13347 • Published Jul 17 • 64
TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation Paper • 2507.04984 • Published Jul 7 • 5
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers Paper • 2507.12956 • Published Jul 17 • 23
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second Paper • 2507.10065 • Published Jul 14 • 24
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit Training Paper • 2505.11594 • Published May 16 • 76