arxiv:2601.11044
Keyu Li (SII)
weizhihao1
AI & ML interests
Large Language Model Agent, Multi-Agent System
Recent Activity
updated
a dataset
2 days ago
GAIR/AgencyBench
commented on
a paper
5 days ago
AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
published
a dataset
6 days ago
GAIR/AgencyBench