Kimi-K2 Collection Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 4 items • Updated 5 days ago • 142
Running on CPU Upgrade 2.01k 2.01k The Smol Training Playbook: The Secrets to Building World-Class LLMs 📝 Display loss curves for training LLMs
MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing Paper • 2509.22186 • Published Sep 26 • 130