EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots Paper • 2602.18071 • Published Feb 20 • 22
Generated Reality: Human-centric World Simulation using Interactive Video Generation with Hand and Camera Control Paper • 2602.18422 • Published Feb 20 • 30
Does Your Reasoning Model Implicitly Know When to Stop Thinking? Paper • 2602.08354 • Published Feb 9 • 262
MultiShotMaster: A Controllable Multi-Shot Video Generation Framework Paper • 2512.03041 • Published Dec 2, 2025 • 66
OPUS: Towards Efficient and Principled Data Selection in Large Language Model Pre-training in Every Iteration Paper • 2602.05400 • Published Feb 5 • 349
DreamDojo: A Generalist Robot World Model from Large-Scale Human Videos Paper • 2602.06949 • Published Feb 6 • 36
Running on CPU Upgrade 1.29k Omni Image Editor 🖼 1.29k Image edit, text to image, image upscale, remove watermark
Running on Zero MCP 1.63k Wan2.2 14B Preview 🐌 1.63k generate a video from an image with a text prompt
Running on Zero MCP Featured 1.18k Qwen-Image-Edit-2511-LoRAs-Fast 🎃 1.18k Demo of the Collection of Qwen Image Edit LoRAs
Running on Zero Featured 1.76k Qwen3-TTS Demo 🎙 1.76k Generate speech audio via voice design, cloning, or preset speakers
Running on Zero MCP 2.7k Z Image Turbo 🖼 2.7k Generate high-quality images from text prompts in seconds