The MiniMax-M2 Series: Mini Activations Unleashing Max Real-World Intelligence Paper • 2605.26494 • Published 1 day ago • 21
Look Before You Leap: Autonomous Exploration for LLM Agents Paper • 2605.16143 • Published 13 days ago • 9
📊 DNA benchmarks Collection Zero-shot DNA benchmarks for Variant Effect prediction, Sequence Recovery and Perturbation tasks. • 5 items • Updated 8 days ago • 9
Laguna XS.2 Collection Designed for agentic coding and long-horizon work on a local machine. Apache 2.0. • 5 items • Updated 20 days ago • 21
NVIDIA Nemotron v3 Collection Open, Production-ready Enterprise Models • 18 items • Updated 8 days ago • 296
ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads? Paper • 2602.19594 • Published Feb 23 • 3
Structured Distillation of Web Agent Capabilities Enables Generalization Paper • 2604.07776 • Published Apr 9 • 23
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models Paper • 2601.14004 • Published Jan 20 • 48