AppAgent: Multimodal Agents as Smartphone Users Paper • 2312.13771 • Published Dec 21, 2023 • 54
ARB: Advanced Reasoning Benchmark for Large Language Models Paper • 2307.13692 • Published Jul 25, 2023 • 17