Odysseus Collection The models from the paper "Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning", trained to play Super Mario Land • 2 items • Updated 2 days ago
MLS-Bench: A Holistic and Rigorous Assessment of AI Systems on Building Better AI Paper • 2605.08678 • Published 19 days ago • 8
Building Math Agents with Multi-Turn Iterative Preference Learning Paper • 2409.02392 • Published Sep 4, 2024 • 16
Provably Efficient Offline Reinforcement Learning with Perturbed Data Sources Paper • 2306.08364 • Published Jun 14, 2023
Odysseus: Scaling VLMs to 100+ Turn Decision-Making in Games via Reinforcement Learning Paper • 2605.00347 • Published 27 days ago • 16