12 45 80

Weiyun Wang

Weiyun1025

Weiyun1025

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

commented a paper 4 days ago

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

upvoted a paper 4 days ago

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

View all activity

Organizations

Weiyun1025's activity

upvoted a paper 1 day ago

SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding

Paper • 2412.09604 • Published 5 days ago • 35

commented a paper 4 days ago

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

Paper • 2412.08687 • Published 6 days ago • 11 •

upvoted a paper 4 days ago

VisionArena: 230K Real World User-VLM Conversations with Preference Labels

Paper • 2412.08687 • Published 6 days ago • 11

updated a dataset 4 days ago

OpenGVLab/V2PE-Data

Preview • Updated 4 days ago • 28 • 3

New activity in OpenGVLab/V2PE 5 days ago

Create README.md

#1 opened 5 days ago by

dreamerlin

updated a collection 5 days ago

V2PE

Collection

Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding • 3 items • Updated 5 days ago • 1

New activity in OpenGVLab/V2PE-Data 5 days ago

Update README.md

#2 opened 5 days ago by

dreamerlin

Create README.md

#1 opened 5 days ago by

dreamerlin

upvoted a paper 5 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published 6 days ago • 82

updated a collection 6 days ago

V2PE

Collection

Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encoding • 3 items • Updated 5 days ago • 1

updated a model 6 days ago

OpenGVLab/V2PE

Updated 5 days ago • 1

updated a collection 6 days ago

MPO

Collection

Enhancing the Reasoning Ability of Multimodal Large Language Models via Mixed Preference Optimization • 3 items • Updated 5 days ago • 1

upvoted 3 papers 7 days ago

authored a paper 9 days ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published 11 days ago • 110

liked 3 models 9 days ago

OpenGVLab/InternViT-6B-448px-V2_5

Image Feature Extraction • Updated 9 days ago • 247 • 16

OpenGVLab/InternViT-300M-448px-V2_5

Image Feature Extraction • Updated 9 days ago • 6.49k • 10

OpenGVLab/InternVL2_5-1B

Image-Text-to-Text • Updated 9 days ago • 4.16k • 28