Running on CPU Upgrade Featured 2.95k The Smol Training Playbook 📚 2.95k The secrets to building world-class LLMs
MixCPT Collection Rethinking Multilingual Continual Pretraining: Data Mixing for Adapting LLMs Across Languages and Resources • 41 items • Updated Oct 21, 2025 • 1
Test-Time Scaling of Reasoning Models for Machine Translation Paper • 2510.06471 • Published Oct 7, 2025 • 1
Test-Time Scaling of Reasoning Models for Machine Translation Paper • 2510.06471 • Published Oct 7, 2025 • 1 • 2
EMMA-500: Enhancing Massively Multilingual Adaptation of Large Language Models Paper • 2409.17892 • Published Sep 26, 2024 • 2
GlotEval: A Test Suite for Massively Multilingual Evaluation of Large Language Models Paper • 2504.04155 • Published Apr 5, 2025 • 1