Bigger, Better, Faster: Human-level Atari with human-level efficiency Paper • 2305.19452 • Published May 30, 2023 • 5
Grounding Computer Use Agents on Human Demonstrations Paper • 2511.07332 • Published Nov 10, 2025 • 106
Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training Paper • 2503.18929 • Published Mar 24, 2025 • 4