Jonathan Roberts's picture

Jonathan Roberts PRO

jonathan-roberts1

·

AI & ML interests

VLMs, LLMs, LMMs

Recent Activity

upvoted a paper 2 days ago

Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games

authored a paper 3 days ago

Humanity's Last Exam

authored a paper 3 days ago

Beyond Outcomes: Transparent Assessment of LLM Reasoning in Games

View all activity

Organizations

Papers 10

arxiv:2601.11518

arxiv:2502.09696

arxiv:2501.14249

arxiv:2412.13602

models 0

None public yet

datasets 35

jonathan-roberts1/GRAB-lite

Viewer • Updated Dec 24, 2025 • 500 • 22

jonathan-roberts1/GRAB

Viewer • Updated Dec 24, 2025 • 2.17k • 5 • 4

jonathan-roberts1/GRAB-real

Viewer • Updated Dec 24, 2025 • 1.11k • 27

jonathan-roberts1/zerobench

Viewer • Updated Dec 23, 2025 • 434 • 543 • 29

jonathan-roberts1/zerobench_no_answers

Viewer • Updated Jul 30, 2025 • 434 • 5

jonathan-roberts1/SciFIBench

Viewer • Updated Jan 10, 2025 • 2k • 128 • 4

jonathan-roberts1/needle-threading

Viewer • Updated Nov 8, 2024 • 10.9k • 19 • 3

jonathan-roberts1/SATIN

Updated May 14, 2024 • 59 • 8

jonathan-roberts1/EuroSAT

Viewer • Updated Jan 8, 2024 • 27k • 278 • 3

jonathan-roberts1/AID_MultiLabel

Viewer • Updated Apr 3, 2023 • 3k • 20

View 35 datasets