Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 19 days ago • 229
Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles Paper • 2605.22177 • Published 5 days ago • 18
Video2GUI: Synthesizing Large-Scale Interaction Trajectories for Generalized GUI Agent Pretraining Paper • 2605.14747 • Published 12 days ago • 143
DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models Paper • 2605.07210 • Published 18 days ago • 4
CGM-JEPA: Learning Consistent Continuous Glucose Monitor Representations via Predictive Self-Supervised Pretraining Paper • 2605.00933 • Published 25 days ago • 2
Length Value Model: Scalable Value Pretraining for Token-Level Length Modeling Paper • 2604.27039 • Published 27 days ago • 24
Leveraging Verifier-Based Reinforcement Learning in Image Editing Paper • 2604.27505 • Published 26 days ago • 57
Heterogeneous Scientific Foundation Model Collaboration Paper • 2604.27351 • Published 26 days ago • 218
Modeling Sparse and Bursty Vulnerability Sightings: Forecasting Under Data Constraints Paper • 2604.16038 • Published Apr 17 • 3
GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning Paper • 2604.02721 • Published Apr 3 • 630