TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks Paper • 2412.14161 • Published 4 days ago • 41
Thinking in Space: How Multimodal Large Language Models See, Remember, and Recall Spaces Paper • 2412.14171 • Published 4 days ago • 19
The Open Source Advantage in Large Language Models (LLMs) Paper • 2412.12004 • Published 6 days ago • 8