Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published Sep 18 • 75
OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework Paper • 2202.03052 • Published Feb 7, 2022
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities Paper • 2305.11172 • Published May 18, 2023 • 1
Sketch and Refine: Towards Faithful and Informative Table-to-Text Generation Paper • 2105.14778 • Published May 31, 2021