QLASS: Boosting Language Agent Inference via Q-Guided Stepwise Search Paper • 2502.02584 • Published 5 days ago • 14
Zero-Shot Novel View and Depth Synthesis with Multi-View Geometric Diffusion Paper • 2501.18804 • Published 10 days ago • 5
Junyi42/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt Image-to-3D • Updated Oct 30, 2024 • 7.91k • 16
Junyi42/MonST3R_PO-TA-S-W_ViTLarge_BaseDecoder_512_dpt Image-to-3D • Updated Oct 30, 2024 • 7.91k • 16
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4, 2024 • 19
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4, 2024 • 19 • 3
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4, 2024 • 19
MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion Paper • 2410.03825 • Published Oct 4, 2024 • 19 • 3
CameraCtrl: Enabling Camera Control for Text-to-Video Generation Paper • 2404.02101 • Published Apr 2, 2024 • 22
Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers Paper • 2402.19479 • Published Feb 29, 2024 • 33
stabilityai/stable-video-diffusion-img2vid-xt-1-1 Image-to-Video • Updated Jul 10, 2024 • 104k • 838
LayoutDiffusion: Improving Graphic Layout Generation by Discrete Diffusion Probabilistic Models Paper • 2303.11589 • Published Mar 21, 2023
A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence Paper • 2305.15347 • Published May 24, 2023