RLEF: Grounding Code LLMs in Execution Feedback with Reinforcement Learning Paper • 2410.02089 • Published Oct 2 • 12
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published 29 days ago • 46
OS-ATLAS: A Foundation Action Model for Generalist GUI Agents Paper • 2410.23218 • Published 29 days ago • 46
What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective Paper • 2410.23743 • Published 28 days ago • 59
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper • 2410.18603 • Published Oct 24 • 30
Precise and Dexterous Robotic Manipulation via Human-in-the-Loop Reinforcement Learning Paper • 2410.21845 • Published about 1 month ago • 11
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper • 2410.18603 • Published Oct 24 • 30
AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant Paper • 2410.18603 • Published Oct 24 • 30 • 2
Qwen2-VL Collection Vision-language model series based on Qwen2 • 15 items • Updated about 8 hours ago • 159
Neural Amortized Inference for Nested Multi-agent Reasoning Paper • 2308.11071 • Published Aug 21, 2023 • 3
TransCoder: Towards Unified Transferable Code Representation Learning Inspired by Human Skills Paper • 2306.07285 • Published May 23, 2023 • 2
Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models Paper • 2405.12939 • Published May 21 • 1
Interactive Evolution: A Neural-Symbolic Self-Training Framework For Large Language Models Paper • 2406.11736 • Published Jun 17 • 5