Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents Paper • 2509.26354 • Published 17 days ago • 17
Conditional Advantage Estimation for Reinforcement Learning in Large Reasoning Models Paper • 2509.23962 • Published 19 days ago • 5
Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Step Paper • 2509.23924 • Published 19 days ago • 7
DADM: Dual Alignment of Domain and Modality for Face Anti-spoofing Paper • 2503.00429 • Published Mar 1 • 1
Kronecker Mask and Interpretive Prompts are Language-Action Video Learners Paper • 2502.03549 • Published Feb 5 • 1
Generalized Face Anti-spoofing via Finer Domain Partition and Disentangling Liveness-irrelevant Factors Paper • 2407.08243 • Published Jul 11, 2024 • 1
G$^2$V$^2$former: Graph Guided Video Vision Transformer for Face Anti-Spoofing Paper • 2408.07675 • Published Aug 14, 2024 • 1
RiOSWorld: Benchmarking the Risk of Multimodal Compter-Use Agents Paper • 2506.00618 • Published May 31 • 1