Tool Use - a doing Collection

doing 's Collections

Tool Use

updated 2 days ago

ToolHop: A Query-Driven Benchmark for Evaluating Large Language Models in Multi-Hop Tool Use

Paper • 2501.02506 • Published 5 days ago • 7