LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training Paper • 2509.23661 • Published 22 days ago • 44
LLaVA-OneVision-1.5 Collection https://github.com/EvolvingLMMs-Lab/LLaVA-OneVision-1.5 • 9 items • Updated 9 days ago • 16
RoboBrain2.0 Collection RoboBrain 2.0: See Better. Think Harder. Do Smarter. • 6 items • Updated Jul 23 • 17
RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics Paper • 2506.04308 • Published Jun 4 • 43