Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Buckets new
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Sihan XU's picture
5 12 20

Sihan XU

sihanxu
marstin's profile picture Tian-Xia's profile picture yifan-Eva's profile picture
·
https://sihanxu.github.io/
  • 6SihanXu
  • SihanXU

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago
SixAILab/dit-2-b-400k
published a model 5 days ago
SixAILab/dit-2-b-400k
upvoted an article 13 days ago
NEO-unify: Building Native Multimodal Unified Models End to End
View all activity

Organizations

University of Michigan's profile picture Situated Language and Embodied Dialogue Lab's profile picture SixAILab's profile picture 2Infinity Lab's profile picture Forty-Two AI Lab's profile picture

authored 3 papers 3 months ago

Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation

Paper • 2504.16060 • Published Apr 22, 2025

4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time

Paper • 2506.18890 • Published Jun 23, 2025 • 6

Next-Embedding Prediction Makes Strong Vision Learners

Paper • 2512.16922 • Published Dec 18, 2025 • 88
authored a paper over 1 year ago

Multi-Object Hallucination in Vision-Language Models

Paper • 2407.06192 • Published Jul 8, 2024 • 12
authored 2 papers over 2 years ago

CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation

Paper • 2310.13165 • Published Oct 19, 2023

Inversion-Free Image Editing with Natural Language

Paper • 2312.04965 • Published Dec 7, 2023 • 2
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs