FiffDepth: Feed-forward Transformation of Diffusion-Based Generators for Detailed Depth Estimation Paper • 2412.00671 • Published Dec 1, 2024 • 1
Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis Paper • 2412.15322 • Published 22 days ago • 18
DepthMaster: Taming Diffusion Models for Monocular Depth Estimation Paper • 2501.02576 • Published 5 days ago • 6