rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8, 2025 • 288
💧 LFM2.5 Collection Collection of Instruct, Base, and Japanese LFM2.5-1.2B models. • 19 items • Updated 21 minutes ago • 67
Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling Paper • 2601.02346 • Published 7 days ago • 25
view article Article NVIDIA Cosmos Reason 2 Brings Advanced Reasoning To Physical AI 7 days ago • 42
TimeBill: Time-Budgeted Inference for Large Language Models Paper • 2512.21859 • Published 18 days ago • 24
view article Article nanoVLM: The simplest repository to train your VLM in pure PyTorch +5 May 21, 2025 • 247
Optimal Sparsity Math Collection Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks • 67 items • Updated Aug 19, 2025 • 2
view article Article The Open Evaluation Standard: Benchmarking NVIDIA Nemotron 3 Nano with NeMo Evaluator 26 days ago • 44
view article Article Tokenization in Transformers v5: Simpler, Clearer, and More Modular +4 26 days ago • 111
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 18 items • Updated about 15 hours ago • 47
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning Paper • 2511.19900 • Published Nov 25, 2025 • 48