World-Language-Action Model for Unified World Modeling, Language Reasoning, and Action Synthesis Paper • 2606.05979 • Published 3 days ago • 5
Flash-WAM: Modality-Aware Distillation for World Action Models Paper • 2606.05254 • Published 4 days ago • 4
Discrete-WAM: Unified Discrete Vision-Action Token Editing for World-Policy Learning Paper • 2606.05645 • Published 3 days ago • 1
MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery Paper • 2606.06473 • Published 3 days ago • 4
Representation Forcing for Bottleneck-Free Unified Multimodal Models Paper • 2605.31604 • Published 9 days ago • 57
AutoLab: Can Frontier Models Solve Long-Horizon Auto Research and Engineering Tasks? Paper • 2606.05080 • Published 4 days ago • 27
MeshWeaver: Sparse-Voxel-Guided Surface Weaving for Autoregressive Mesh Generation Paper • 2606.04688 • Published 4 days ago • 3
GRAIL: Generating Humanoid Loco-Manipulation from 3D Assets and Video Priors Paper • 2606.05160 • Published 4 days ago • 7
NVIDIA OmniDreams: Real-Time Generative World Model for Closed-Loop Autonomous Vehicle Simulation Paper • 2606.03159 • Published 5 days ago • 21
Language Models Need Sleep: Learning to Self-Modify and Consolidate Memories Paper • 2606.03979 • Published 5 days ago • 24
Humanoid-GPT: Scaling Data and Structure for Zero-Shot Motion Tracking Paper • 2606.03985 • Published 5 days ago • 38
3DCodeBench: Benchmarking Agentic Procedural 3D Modeling Via Code Paper • 2606.01057 • Published 7 days ago • 7
Thinking in Blender: Staged Executable Inverse Graphics with Vision-Language Models Paper • 2606.02580 • Published 6 days ago • 2
StressDream: Steering Video World Models for Robust Policy Evaluation and Improvement Paper • 2606.00267 • Published 9 days ago • 2
Linear Scaling Video VLMs for Long Video Understanding Paper • 2605.31598 • Published 9 days ago • 11
Light Interaction: Training-Free Inference Acceleration for Interactive Video World Models Paper • 2605.31158 • Published 9 days ago • 2