view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 3 days ago β’ 248
π FineMath Collection FineMath datasets and ablation models β’ 14 items β’ Updated 23 days ago β’ 19
PaliGemma 2: A Family of Versatile VLMs for Transfer Paper β’ 2412.03555 β’ Published Dec 4, 2024 β’ 129
PaliGemma 2 Release Collection Vision-Language Models available in multiple 3B, 10B and 28B variants. β’ 32 items β’ Updated 3 days ago β’ 145
LayerSkip Collection Models continually pretrained using LayerSkip - https://arxiv.org/abs/2404.16710 β’ 8 items β’ Updated Nov 21, 2024 β’ 47
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. β’ 27 items β’ Updated 3 days ago β’ 57
view article Article Llama can now see and run on your device - welcome Llama 3.2 Sep 25, 2024 β’ 184
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 β’ 15 items β’ Updated Dec 6, 2024 β’ 576
HelloBench: Evaluating Long Text Generation Capabilities of Large Language Models Paper β’ 2409.16191 β’ Published Sep 24, 2024 β’ 42
VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads Paper β’ 2407.18245 β’ Published Jul 25, 2024 β’ 10
Gemma Scope Release Collection A comprehensive, open suite of sparse autoencoders for Gemma 2 2B and 9B. β’ 10 items β’ Updated 3 days ago β’ 17
LLM Compiler Collection Meta LLM Compiler is a state-of-the-art LLM that builds upon Code Llama with improved performance for code optimization and compiler reasoning. β’ 4 items β’ Updated Jun 27, 2024 β’ 149
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 β’ 193
Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models Paper β’ 2403.18814 β’ Published Mar 27, 2024 β’ 47