In-the-Flow Agentic System Optimization for Effective Planning and Tool Use Paper • 2510.05592 • Published 21 days ago • 94
Where LLM Agents Fail and How They can Learn From Failures Paper • 2509.25370 • Published 28 days ago • 11
Where LLM Agents Fail and How They can Learn From Failures Paper • 2509.25370 • Published 28 days ago • 11 • 2
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Paper • 2505.23559 • Published May 29 • 11
SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents Paper • 2505.23559 • Published May 29 • 11 • 2
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 300
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 300
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper • 2503.01935 • Published Mar 3 • 29
MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents Paper • 2503.01935 • Published Mar 3 • 29 • 3
EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Paper • 2502.09560 • Published Feb 13 • 35
Eurus Collection Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Aug 7 • 26
Trelis/Llama-2-7b-chat-hf-function-calling-v2 Text Generation • 7B • Updated Nov 24, 2023 • 945 • 137