arxiv:2501.07783
Zhaokai Wang
wzk1015
AI & ML interests
Computer Vision
Music Generation
Multimodal Large Language Models
Recent Activity
updated
a model
1 day ago
OpenGVLab/PIIP
liked
a model
1 day ago
OpenGVLab/V2PE
authored
a paper
2 days ago
Parameter-Inverted Image Pyramid Networks for Visual Perception and
Multimodal Understanding
Organizations
Papers
11
models
None public yet
datasets
None public yet