arxiv:2601.11518
Jonathan Roberts PRO
jonathan-roberts1
AI & ML interests
VLMs, LLMs, LMMs
Recent Activity
upvoted
a
paper
2 days ago
Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games
authored
a paper
3 days ago
Humanity's Last Exam
authored
a paper
3 days ago
Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games