DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning Paper • 2406.11896 • Published Jun 14, 2024 • 19
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning Paper • 2405.10292 • Published May 16, 2024 • 1
A Corpus for Reasoning About Natural Language Grounded in Photographs Paper • 1811.00491 • Published Nov 1, 2018
Autonomous Evaluation and Refinement of Digital Agents Paper • 2404.06474 • Published Apr 9, 2024 • 2
Grounding Language in Multi-Perspective Referential Communication Paper • 2410.03959 • Published Oct 4, 2024 • 4
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 10 days ago • 19
Grounding Language in Multi-Perspective Referential Communication Paper • 2410.03959 • Published Oct 4, 2024 • 4
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training Paper • 2306.01693 • Published Jun 2, 2023 • 3
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling Paper • 2301.12050 • Published Jan 28, 2023
Quantifying Language Models' Sensitivity to Spurious Features in Prompt Design or: How I learned to start worrying about prompt formatting Paper • 2310.11324 • Published Oct 17, 2023 • 1
Autonomous Evaluation and Refinement of Digital Agents Paper • 2404.06474 • Published Apr 9, 2024 • 2
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations Paper • 2311.08469 • Published Nov 14, 2023 • 10
Continual Learning for Instruction Following from Realtime Feedback Paper • 2212.09710 • Published Dec 19, 2022