Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Bohan Zhai's picture
3 7 1

Bohan Zhai PRO

Borise
YaYaGeGe's profile picture shijiay's profile picture 21world's profile picture
·

AI & ML interests

LLM, Audio, NLP, 3D vision, vision language

Recent Activity

liked a dataset 1 day ago
Borise/CaptionQA
commented on a paper 3 days ago
CaptionQA: Is Your Caption as Useful as the Image Itself?
upvoted an article 4 days ago
📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important Than You Think
View all activity

Organizations

sfai-temp-reasoning-model's profile picture

Articles 2

Article
3

📌 Rethinking Multimodality from an Industry Perspective: Captioning Is Far More Important Than You Think

Article
20

Key Insights into the Law of Vision Representations in MLLMs

View all Articles

Papers 7

arxiv:2511.21025
arxiv:2503.19988
arxiv:2408.16357
arxiv:2403.01487

models 5

Borise/llava_qwen2_dit_stage2_14B

15B • Updated May 30 • 3

Borise/llava_qwen2_dino224_stage2_14B

15B • Updated May 30 • 3

Borise/llava_qwen2_clip336_stage2_14B

15B • Updated May 30 • 2

Borise/llava_qwen2_clip224_stage2_14B

15B • Updated May 30 • 3

Borise/zephyr-7b-dpo-full

Text Generation • 7B • Updated Oct 3, 2024 • 7

datasets 1

Borise/CaptionQA

Viewer • Updated 6 days ago • 1.31k • 525 • 4
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs