Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
zooblastlbz 's Collections
agent
mm
reason

reason

updated Apr 11
Upvote
-

  • rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

    Paper • 2501.04519 • Published Jan 8 • 284

  • Evolving Deeper LLM Thinking

    Paper • 2501.09891 • Published Jan 17 • 116

  • START: Self-taught Reasoner with Tools

    Paper • 2503.04625 • Published Mar 6 • 114

  • SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild

    Paper • 2503.18892 • Published Mar 24 • 32

  • I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

    Paper • 2503.18878 • Published Mar 24 • 121

  • JudgeLRM: Large Reasoning Models as a Judge

    Paper • 2504.00050 • Published Mar 31 • 62

  • Hogwild! Inference: Parallel LLM Generation via Concurrent Attention

    Paper • 2504.06261 • Published Apr 8 • 111
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs