SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M ā¢ 10 items ā¢ Updated 3 days ago ā¢ 176
view article Article š®š¹šÆšµš§š· Generating multilingual instruction datasets with Magpie š¦āā¬ By anakin87 ā¢ Oct 21 ā¢ 18
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models Paper ā¢ 2405.01535 ā¢ Published May 2 ā¢ 118
Phi-3 Collection Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. ā¢ 26 items ā¢ Updated 10 days ago ā¢ 498
view article Article š¤ PEFT: Parameter-Efficient Fine-Tuning of Billion-Scale Models on Low-Resource Hardware Feb 10, 2023 ā¢ 37
Llama-3.1-Nemotron-70B Collection SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. ā¢ 6 items ā¢ Updated Oct 15 ā¢ 141
Qwen2-VL Collection Vision-language model series based on Qwen2 ā¢ 15 items ā¢ Updated Sep 18 ā¢ 157
Qwen2-Audio Collection Audio-language model series based on Qwen2 ā¢ 4 items ā¢ Updated Sep 18 ā¢ 45
Nemotron 3 8B Collection The Nemotron 3 8B Family of models is optimized for building production-ready generative AI applications for the enterprise. ā¢ 5 items ā¢ Updated Oct 1 ā¢ 46
view article Article Multimodal Augmentation for Documents: Recovering āComprehensionā in āReading and Comprehensionā task By danaaubakirova ā¢ May 16 ā¢ 17
DonaciĆ³n Somos600M Collection ColecciĆ³n de los corpus donados para el Hackathon de SomosNLP 2024: #somos600M ā¢ 4 items ā¢ Updated Mar 9 ā¢ 2
Awesome SFT datasets Collection A curated list of interesting datasets to fine-tune language models with. ā¢ 43 items ā¢ Updated Apr 12 ā¢ 119
š¤ TinyLlama Alignment Collection TinyLlama-1.1B model aligned on Intel's Orca dataset. Comparison of DPO/IPO/KTO. ā¢ 3 items ā¢ Updated Mar 22 ā¢ 1
Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws Paper ā¢ 2401.00448 ā¢ Published Dec 31, 2023 ā¢ 28