Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
dongguanting 's Collections
ARPO
Tool-Star
RAG-Critic

Tool-Star

updated 4 days ago

Tool-Star is a reinforcement learning-based framework designed to empower LLMs to autonomously invoke multiple external tools during stepwise reasonin

Upvote
5

  • Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning

    Paper • 2505.16410 • Published May 22 • 57

  • dongguanting/Tool-Star-SFT-54K

    Viewer • Updated May 29 • 54k • 269 • 8

  • dongguanting/Multi-Tool-RL-10K

    Viewer • Updated May 25 • 10k • 134 • 4

  • dongguanting/Tool-Star-Qwen-7B

    Text Generation • 8B • Updated Jun 30 • 20 • 2

  • dongguanting/Tool-Star-Qwen-3B

    Text Generation • 3B • Updated May 25 • 19 • 5

  • dongguanting/Tool-Star-Qwen-1.5B

    Text Generation • 2B • Updated Jun 6 • 24 • 2

  • dongguanting/Tool-Star-Qwen-0.5B

    Text Generation • 0.6B • Updated Jun 6 • 10 • 1
Upvote
5
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs