Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer Paper • 2510.06590 • Published Oct 8 • 73
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper • 2512.07802 • Published 4 days ago • 41
PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval Paper • 2405.10160 • Published May 16, 2024 • 1
RSCC: A Large-Scale Remote Sensing Change Caption Dataset for Disaster Events Paper • 2509.01907 • Published Sep 2 • 2
SegEarth-OV: Towards Traning-Free Open-Vocabulary Segmentation for Remote Sensing Images Paper • 2410.01768 • Published Oct 2, 2024 • 4
PANORAMA: The Rise of Omnidirectional Vision in the Embodied AI Era Paper • 2509.12989 • Published Sep 16 • 28
EarthSynth: Generating Informative Earth Observation with Diffusion Models Paper • 2505.12108 • Published May 17 • 2
Locate Anything on Earth: Advancing Open-Vocabulary Object Detection for Remote Sensing Community Paper • 2408.09110 • Published Aug 17, 2024 • 2