4 13 3

KunlunZhu

AI & ML interests

None yet

Recent Activity

upvoted a paper 19 days ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

upvoted a paper 26 days ago

Where LLM Agents Fail and How They can Learn From Failures

commented on a paper 26 days ago

Where LLM Agents Fail and How They can Learn From Failures

View all activity

Organizations

upvoted a paper 19 days ago

In-the-Flow Agentic System Optimization for Effective Planning and Tool Use

Paper • 2510.05592 • Published 20 days ago • 92

upvoted a paper 26 days ago

Where LLM Agents Fail and How They can Learn From Failures

Paper • 2509.25370 • Published 28 days ago • 11

upvoted a paper 3 months ago

Sotopia-RL: Reward Design for Social Intelligence

Paper • 2508.03905 • Published Aug 5 • 23

upvoted a paper 5 months ago

SafeScientist: Toward Risk-Aware Scientific Discoveries by LLM Agents

Paper • 2505.23559 • Published May 29 • 11

upvoted a paper 7 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300

upvoted a paper 8 months ago

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Paper • 2503.01935 • Published Mar 3 • 29

upvoted 2 articles 8 months ago

Article

arXiv实用技巧，如何让你的paper关注度变高？

•

Jul 8, 2024

• 14

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.31k

upvoted a paper 8 months ago

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Paper • 2502.09560 • Published Feb 13 • 35

upvoted a paper about 1 year ago

Exploring Format Consistency for Instruction Tuning

Paper • 2307.15504 • Published Jul 28, 2023 • 8

upvoted a collection over 1 year ago

Eurus

Collection

Advancing LLM Reasoning Generalists with Preference Trees • 11 items • Updated Aug 7 • 26

upvoted a paper almost 2 years ago

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 237

upvoted a paper about 2 years ago

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Paper • 2307.16789 • Published Jul 31, 2023 • 101

KunlunZhu

AI & ML interests

Recent Activity

Organizations

KunlunZhu's activity

arXiv实用技巧，如何让你的paper关注度变高？

Open-source DeepResearch – Freeing our search agents