Collections
Discover the best community collections!
Collections including paper arxiv:2311.06242
-
Kosmos-2.5: A Multimodal Literate Model
Paper • 2309.11419 • Published • 50 -
Mirasol3B: A Multimodal Autoregressive model for time-aligned and contextual modalities
Paper • 2311.05698 • Published • 9 -
Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks
Paper • 2311.06242 • Published • 84 -
PolyMaX: General Dense Prediction with Mask Transformer
Paper • 2311.05770 • Published • 6