Photometric Inverse Rendering: Shading Cues Modeling and Surface Reflectance Regularization Paper • 2408.06828 • Published Aug 13, 2024
Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models Paper • 2410.10821 • Published Oct 14, 2024 • 1
LumiTex: Towards High-Fidelity PBR Texture Generation with Illumination Context Paper • 2511.19437 • Published Nov 24, 2025
CaliTex: Geometry-Calibrated Attention for View-Coherent 3D Texture Generation Paper • 2511.21309 • Published Nov 26, 2025
Janus: Disaggregating Attention and Experts for Scalable MoE Inference Paper • 2512.13525 • Published Dec 15, 2025 • 6
Janus: Disaggregating Attention and Experts for Scalable MoE Inference Paper • 2512.13525 • Published Dec 15, 2025 • 6
Presenting a Paper is an Art: Self-Improvement Aesthetic Agents for Academic Presentations Paper • 2510.05571 • Published Oct 7, 2025 • 15
Elucidating The Design Space of Classifier-Guided Diffusion Generation Paper • 2310.11311 • Published Oct 17, 2023
Explore and Exploit the Diverse Knowledge in Model Zoo for Domain Generalization Paper • 2306.02595 • Published Jun 5, 2023
On the Expressive Power of a Variant of the Looped Transformer Paper • 2402.13572 • Published Feb 21, 2024
Towards Understanding How Transformer Perform Multi-step Reasoning with Matching Operation Paper • 2405.15302 • Published May 24, 2024
Elucidating the design space of language models for image generation Paper • 2410.16257 • Published Oct 21, 2024
Rewards Are Enough for Fast Photo-Realistic Text-to-image Generation Paper • 2503.13070 • Published Mar 17, 2025 • 10
Learning Few-Step Diffusion Models by Trajectory Distribution Matching Paper • 2503.06674 • Published Mar 9, 2025 • 8
TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets Paper • 2502.01506 • Published Feb 3, 2025 • 39
UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models Paper • 2410.14059 • Published Oct 17, 2024 • 63
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications Paper • 2408.11878 • Published Aug 20, 2024 • 64
HuatuoGPT-Vision, Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale Paper • 2406.19280 • Published Jun 27, 2024 • 63
VDC: Versatile Data Cleanser for Detecting Dirty Samples via Visual-Linguistic Inconsistency Paper • 2309.16211 • Published Sep 28, 2023