No Label Left Behind: A Unified Surface Defect Detection Model for all Supervision Regimes Paper • 2508.19060 • Published 10 days ago • 8
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents Paper • 2508.17198 • Published 12 days ago • 6
UI-Level Evaluation of ALLaM 34B: Measuring an Arabic-Centric LLM via HUMAIN Chat Paper • 2508.17378 • Published 12 days ago • 6
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench Paper • 2508.20931 • Published 8 days ago • 15
T2R-bench: A Benchmark for Generating Article-Level Reports from Real World Industrial Tables Paper • 2508.19813 • Published 9 days ago • 20
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning Paper • 2508.21104 • Published 8 days ago • 27
UItron: Foundational GUI Agent with Advanced Perception and Planning Paper • 2508.21767 • Published 7 days ago • 11
TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training Paper • 2508.17677 • Published 11 days ago • 14
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published 7 days ago • 21
TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis Paper • 2508.13618 • Published 17 days ago • 17
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published 8 days ago • 125
Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation Paper • 2508.20470 • Published 8 days ago • 64
A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code Paper • 2508.18106 • Published 11 days ago • 73
R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning Paper • 2508.21113 • Published 8 days ago • 102
EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control Paper • 2508.21112 • Published 8 days ago • 71
FastMesh:Efficient Artistic Mesh Generation via Component Decoupling Paper • 2508.19188 • Published 10 days ago • 14
Gaze into the Heart: A Multi-View Video Dataset for rPPG and Health Biomarkers Estimation Paper • 2508.17924 • Published 11 days ago • 14