Running on Zero 978 🌍 Chat With Janus-Pro-7B A unified multimodal understanding and generation model.
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper • 2501.12224 • Published 9 days ago • 46
Go-with-the-Flow: Motion-Controllable Video Diffusion Models Using Real-Time Warped Noise Paper • 2501.08331 • Published 16 days ago • 20
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos Paper • 2501.12375 • Published 9 days ago • 22
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 9 days ago • 47
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published 12 days ago • 22
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 16 days ago • 61