Sihan XU's picture

Sihan XU

sihanxu

·

https://sihanxu.github.io/

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

SixAILab/dit-2-b-400k

published a model 5 days ago

SixAILab/dit-2-b-400k

upvoted an article 13 days ago

NEO-unify: Building Native Multimodal Unified Models End to End

View all activity

Organizations

authored 3 papers 3 months ago

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Paper • 2504.16060 • Published Apr 22, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 88

authored a paper over 1 year ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12

authored 2 papers over 2 years ago

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation

Paper • 2310.13165 • Published Oct 19, 2023

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2