Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 22 items • Updated 13 days ago • 98
GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization Paper • 2601.05242 • Published 24 days ago • 212
MC#: Mixture Compressor for Mixture-of-Experts Large Models Paper • 2510.10962 • Published Oct 13, 2025
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published Dec 23, 2025 • 50
Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models Paper • 2512.20557 • Published Dec 23, 2025 • 50
DC-AR: Efficient Masked Autoregressive Image Generation with Deep Compression Hybrid Tokenizer Paper • 2507.04947 • Published Jul 7, 2025 • 1
Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience Paper • 2512.17260 • Published Dec 19, 2025 • 51
FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos Paper • 2512.10927 • Published Dec 11, 2025 • 6
CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Paper • 2504.13161 • Published Apr 17, 2025 • 93
NaVILA: Legged Robot Vision-Language-Action Model for Navigation Paper • 2412.04453 • Published Dec 5, 2024
EgoVLA: Learning Vision-Language-Action Models from Egocentric Human Videos Paper • 2507.12440 • Published Jul 16, 2025