arxiv:2407.16741
Frank Xu
frankxu
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
TheAgentCompany: Benchmarking LLM Agents on Consequential Real World
Tasks
authored
a paper
6 months ago
OpenDevin: An Open Platform for AI Software Developers as Generalist
Agents
updated
a dataset
6 months ago
OpenHands/eval-output-webarena
Organizations
models
2
datasets
None public yet