arxiv:2412.13871
Xuesong Yang
magicyoung8
AI & ML interests
TTS/ASR, Generative Models.
Recent Activity
authored
a paper
4 days ago
LLaVA-UHD v2: an MLLM Integrating High-Resolution Feature Pyramid via
Hierarchical Window Transformer
Organizations
Papers
1
models
None public yet
datasets
None public yet