CatLIP: CLIP-level Visual Recognition Accuracy with 2.7x Faster Pre-training on Web-scale Image-Text Data Paper β’ 2404.15653 β’ Published Apr 24, 2024 β’ 27