view article Article Training Design for Text-to-Image Models: Lessons from Ablations 2 days ago • 44
FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality Paper • 2410.19355 • Published Oct 25, 2024 • 24
Diversity-Preserved Distribution Matching Distillation for Fast Visual Synthesis Paper • 2602.03139 • Published 3 days ago • 36
3D-Aware Implicit Motion Control for View-Adaptive Human Video Generation Paper • 2602.03796 • Published 2 days ago • 48
Green-VLA: Staged Vision-Language-Action Model for Generalist Robots Paper • 2602.00919 • Published 5 days ago • 199
PISCES: Annotation-free Text-to-Video Post-Training via Optimal Transport-Aligned Rewards Paper • 2602.01624 • Published 4 days ago • 23
M-ErasureBench: A Comprehensive Multimodal Evaluation Benchmark for Concept Erasure in Diffusion Models Paper • 2512.22877 • Published Dec 28, 2025 • 2
JUST-DUB-IT: Video Dubbing via Joint Audio-Visual Diffusion Paper • 2601.22143 • Published 7 days ago • 6
UPLiFT: Efficient Pixel-Dense Feature Upsampling with Local Attenders Paper • 2601.17950 • Published 11 days ago • 4
SALAD: Achieve High-Sparsity Attention via Efficient Linear Attention Tuning for Video Diffusion Transformer Paper • 2601.16515 • Published 14 days ago • 15
VideoMaMa: Mask-Guided Video Matting via Generative Prior Paper • 2601.14255 • Published 16 days ago • 15
HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding Paper • 2601.14724 • Published 16 days ago • 74
Transition Matching Distillation for Fast Video Generation Paper • 2601.09881 • Published 22 days ago • 32
Alterbute: Editing Intrinsic Attributes of Objects in Images Paper • 2601.10714 • Published 21 days ago • 30
SnapGen++: Unleashing Diffusion Transformers for Efficient High-Fidelity Image Generation on Edge Devices Paper • 2601.08303 • Published 24 days ago • 16
MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head Paper • 2601.07832 • Published 24 days ago • 51
GenCtrl -- A Formal Controllability Toolkit for Generative Models Paper • 2601.05637 • Published 28 days ago • 4