RefalMachine/ruadapt_qwen2.5_3B_ext_u48_instruct_v4 Text Generation • Updated Dec 31, 2024 • 529 • 28
IlyaGusev/saiga_llama3_70b_sft_m1_d5_abliterated_kto_m1_d2_awq_4bit Text Generation • Updated Jun 18, 2024 • 4 • 2
PingPong: A Benchmark for Role-Playing Language Models with User Emulation and Multi-Model Evaluation Paper • 2409.06820 • Published Sep 10, 2024 • 64
Minitron Collection A family of compressed models obtained via pruning and knowledge distillation • 12 items • Updated 20 days ago • 60
🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Dec 22, 2024 • 213
Vikhr: The Family of Open-Source Instruction-Tuned Large Language Models for Russian Paper • 2405.13929 • Published May 22, 2024 • 54