The End of Manual Decoding: Towards Truly End-to-End Language Models Paper • 2510.26697 • Published 8 days ago • 113
Scaling Latent Reasoning via Looped Language Models Paper • 2510.25741 • Published 8 days ago • 201
Ming-Flash-Omni: A Sparse, Unified Architecture for Multimodal Perception and Generation Paper • 2510.24821 • Published 10 days ago • 34
VFXMaster: Unlocking Dynamic Visual Effect Generation via In-Context Learning Paper • 2510.25772 • Published 8 days ago • 32
Running on Zero 56 56 Qwen Image Edit Next Scene 🎥 Fast 4 step inference with Qwen Image Edit 2509
Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence Paper • 2510.20579 • Published 15 days ago • 54
Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing Paper • 2510.19808 • Published 15 days ago • 28