AIMv2 A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. Collection by apple 1 day ago 45 apple/aimv2-large-patch14-224 Image Feature Extraction • Updated 1 day ago • 201 • 16 apple/aimv2-huge-patch14-224 Image Feature Extraction • Updated 1 day ago • 30 • 4 apple/aimv2-1B-patch14-224 Image Feature Extraction • Updated 1 day ago • 27 • 3 apple/aimv2-3B-patch14-224 Image Feature Extraction • Updated 1 day ago • 15 • 1
Qwen2.5-Coder Code-specific model series based on Qwen2.5 Collection by Qwen 6 days ago 229 Running 949 🐢 Qwen2.5 Coder Artifacts Running 280 👁 Qwen2.5 Coder Demo Qwen/Qwen2.5-Coder-32B-Instruct Text Generation • Updated 6 days ago • 68.4k • • 956 Qwen/Qwen2.5-Coder-32B Text Generation • Updated 6 days ago • 4.89k • 70
Qwen2.5 Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. Collection by Qwen Sep 18 382 Running 521 🚀 Qwen2.5 Qwen/Qwen2.5-0.5B Text Generation • Updated Sep 25 • 131k • 113 Qwen/Qwen2.5-0.5B-Instruct Text Generation • Updated Sep 25 • 586k • 105 Qwen/Qwen2.5-1.5B Text Generation • Updated Oct 8 • 124k • 42
Llama 3.2 This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 Collection by meta-llama about 1 month ago 490 meta-llama/Llama-3.2-1B Text Generation • Updated about 1 month ago • 1.52M • • 1k meta-llama/Llama-3.2-3B Text Generation • Updated about 1 month ago • 348k • 334 meta-llama/Llama-3.2-1B-Instruct Text Generation • Updated about 1 month ago • 1.67M • • 567 meta-llama/Llama-3.2-3B-Instruct Text Generation • Updated about 1 month ago • 970k • • 663
OpenScholar_V1 The set of models, index, data associated with the paper "OpenScholar: Synthesizing Scientific Literature with Retrieval-Augmented LMs". Collection by OpenScholar 2 days ago 21 OpenScholar/Llama-3.1_OpenScholar-8B Updated 5 days ago • 110 • 34 OpenScholar/OpenScholar_Retriever Updated 5 days ago • 22 • 1 OpenScholar/OpenScholar_Reranker Updated 5 days ago • 17 • 2 OpenScholar/OS_Train_Data Viewer • Updated 6 days ago • 130k • 20 • 3
Tulu 3 Datasets All datasets released with Tulu 3 -- state of the art open post-training recipes. Collection by allenai 2 days ago 18 allenai/tulu-3-sft-mixture Viewer • Updated 2 days ago • 939k • 289 • 23 allenai/llama-3.1-tulu-3-8b-preference-mixture Preview • Updated 2 days ago • 15 • 3 allenai/llama-3.1-tulu-3-70b-preference-mixture Viewer • Updated 2 days ago • 334k • 45 • 6 allenai/tulu-3-sft-personas-math Viewer • Updated 2 days ago • 150k • 18 • 1
Tulu 3 Models All models released with Tulu 3 -- state of the art open post-training recipes. Collection by allenai about 10 hours ago 17 allenai/Llama-3.1-Tulu-3-8B Text Generation • Updated 2 days ago • 3.85k • 41 allenai/Llama-3.1-Tulu-3-70B Text Generation • Updated 2 days ago • 278 • 25 allenai/Llama-3.1-Tulu-3-70B-DPO Text Generation • Updated 2 days ago • 264 • 3 allenai/Llama-3.1-Tulu-3-8B-DPO Text Generation • Updated 2 days ago • 341 • 11
Models for dataset curation Collection by Dataset-Tools 1 day ago 16 HuggingFaceFW/fineweb-edu-classifier Text Classification • Updated 7 days ago • 151k • 133 minishlab/potion-base-8M Updated 24 days ago • 11k • 14 nvidia/domain-classifier Updated Jun 24 • 58.4k • 56 nvidia/quality-classifier-deberta Updated Aug 6 • 3.32k • 48
Sahabat-AI A collection of open-source Large Language Model (LLM) for Bahasa Indonesia and the country’s regional languages. Collection by GoToCompany 11 days ago 27 GoToCompany/gemma2-9b-cpt-sahabatai-v1-base Updated 18 days ago • 355 • 15 GoToCompany/gemma2-9b-cpt-sahabatai-v1-instruct Updated 18 days ago • 1.48k • 22 GoToCompany/llama3-8b-cpt-sahabatai-v1-instruct Updated 18 days ago • 1.23k • 4 GoToCompany/llama3-8b-cpt-sahabatai-v1-base Updated 18 days ago • 95
UltraVox Audio Language Model Release 🔊 Collection by reach-vb 9 days ago 15 fixie-ai/ultravox-v0_4_1-llama-3_1-8b Feature Extraction • Updated 9 days ago • 1.72k • 61 fixie-ai/ultravox-v0_4_1-llama-3_1-70b Feature Extraction • Updated 5 days ago • 472 • 21 fixie-ai/ultravox-v0_4_1-mistral-nemo Feature Extraction • Updated 5 days ago • 871 • 16