arxiv:2501.07783
Xizhou Zhu
Einsiedler
AI & ML interests
None yet
Recent Activity
authored
a paper
2 days ago
Parameter-Inverted Image Pyramid Networks for Visual Perception and
Multimodal Understanding
authored
a paper
about 1 month ago
SynerGen-VL: Towards Synergistic Image Understanding and Generation with
Vision Experts and Token Folding
authored
a paper
about 1 month ago
Expanding Performance Boundaries of Open-Source Multimodal Models with
Model, Data, and Test-Time Scaling
Organizations
None yet
models
None public yet
datasets
None public yet