view article Article Welcome PaliGemma 2 – New vision language models by Google By merve and 3 others • Dec 5, 2024 • 159
Supernova Event Dataset: Interpreting Large Language Model's Personality through Critical Event Analysis Paper • 2506.12189 • Published Jun 13 • 5
An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale Paper • 2010.11929 • Published Oct 22, 2020 • 11
timm Top-20 ImageNet-1k Models Collection The 20 best models on ImageNet-1k validation set, all pretrained on datasets larger than ImageNet and fine-tuned on ImageNet-1k. • 17 items • Updated 3 days ago • 11
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers Paper • 2106.10270 • Published Jun 18, 2021 • 3
The Multimodal Universe: Enabling Large-Scale Machine Learning with 100TB of Astronomical Scientific Data Paper • 2412.02527 • Published Dec 3, 2024 • 12