InternVL2.0 Collection Expanding Performance Boundaries of Open-Source MLLM β’ 15 items β’ Updated 14 days ago β’ 89
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 β’ 10 items β’ Updated Dec 13, 2024 β’ 50
OFA-Sys/chinese-clip-vit-large-patch14-336px Zero-Shot Image Classification β’ Updated Dec 9, 2022 β’ 554 β’ 23
google/siglip-base-patch16-224 Zero-Shot Image Classification β’ Updated Sep 26, 2024 β’ 238k β’ 34
openai/clip-vit-base-patch32 Zero-Shot Image Classification β’ Updated Feb 29, 2024 β’ 14.5M β’ β’ 601
Apollo: An Exploration of Video Understanding in Large Multimodal Models Paper β’ 2412.10360 β’ Published Dec 13, 2024 β’ 139
google/siglip-so400m-patch14-384 Zero-Shot Image Classification β’ Updated Sep 26, 2024 β’ 3.2M β’ 444