Scaling Language-Centric Omnimodal Representation Learning Paper • 2510.11693 • Published 8 days ago • 94
Muon Outperforms Adam in Tail-End Associative Memory Learning Paper • 2509.26030 • Published 22 days ago • 18
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning Paper • 2507.22607 • Published Jul 30 • 46
Error Analyses of Auto-Regressive Video Diffusion Models: A Unified Framework Paper • 2503.10704 • Published Mar 12 • 5
RegMix: Data Mixture as Regression for Language Model Pre-training Paper • 2407.01492 • Published Jul 1, 2024 • 40
Octopus: Embodied Vision-Language Programmer from Environmental Feedback Paper • 2310.08588 • Published Oct 12, 2023 • 38