SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents Paper • 2602.12984 • Published 3 days ago • 2
ABot-M0: VLA Foundation Model for Robotic Manipulation with Action Manifold Learning Paper • 2602.11236 • Published 5 days ago • 8
OneVision-Encoder: Codec-Aligned Sparsity as a Foundational Principle for Multimodal Intelligence Paper • 2602.08683 • Published 7 days ago • 31
Light4D: Training-Free Extreme Viewpoint 4D Video Relighting Paper • 2602.11769 • Published 4 days ago • 1
Code2Worlds: Empowering Coding LLMs for 4D World Generation Paper • 2602.11757 • Published 4 days ago • 1
GeneralVLA: Generalizable Vision-Language-Action Models with Knowledge-Guided Trajectory Planning Paper • 2602.04315 • Published 12 days ago • 1
On Robustness and Chain-of-Thought Consistency of RL-Finetuned VLMs Paper • 2602.12506 • Published 3 days ago • 3
Towards Universal Video MLLMs with Attribute-Structured and Quality-Verified Instructions Paper • 2602.13013 • Published 3 days ago • 2
Xiaomi-Robotics-0: An Open-Sourced Vision-Language-Action Model with Real-Time Execution Paper • 2602.12684 • Published 3 days ago • 1
FLAC: Maximum Entropy RL via Kinetic Energy Regularized Bridge Matching Paper • 2602.12829 • Published 3 days ago • 2
MedXIAOHE: A Comprehensive Recipe for Building Medical MLLMs Paper • 2602.12705 • Published 3 days ago • 38
Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception Paper • 2602.11858 • Published 4 days ago • 47
T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization Paper • 2602.12262 • Published 4 days ago • 8
EgoHumanoid: Unlocking In-the-Wild Loco-Manipulation with Robot-Free Egocentric Demonstration Paper • 2602.10106 • Published 6 days ago • 20
χ_{0}: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies Paper • 2602.09021 • Published 7 days ago • 25
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models Paper • 2602.12036 • Published 4 days ago • 89
The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies Paper • 2602.09877 • Published 6 days ago • 185
Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation Paper • 2602.12125 • Published 4 days ago • 56