LMMs-Lab

community

https://www.lmms-lab.com/

lmmslab

EvolvingLMMs-Lab

AI & ML interests

Feeling and building the multimodal intelligence.

Recent Activity

xiangan new activity about 8 hours ago

lmms-lab/LLaVA-OneVision-1.5-Instruct-Data:⚠️ Missing images in some SFT dataset subsets — please report affected samples

Yin-Xie updated a dataset 2 days ago

lmms-lab/LLaVA-OneVision-1.5-Instruct-Data

xiangan updated a dataset 2 days ago

lmms-lab/LLaVA-NeXT-780k-webdataset

View all activity

xiangan

in lmms-lab/LLaVA-OneVision-1.5-Instruct-Data about 8 hours ago

⚠️ Missing images in some SFT dataset subsets — please report affected samples

#10 opened about 8 hours ago by

Yin-Xie

updated a dataset 2 days ago

lmms-lab/LLaVA-OneVision-1.5-Instruct-Data

Updated 2 days ago • 96.1k • 50

xiangan

updated a dataset 2 days ago

lmms-lab/LLaVA-NeXT-780k-webdataset

Updated 2 days ago • 1.59k

xiangan

updated a collection 2 days ago

LLaVA-OneVision-1.5

https://github.com/EvolvingLMMs-Lab/LLaVA-OneVision-1.5 • 9 items • Updated 2 days ago • 15

Yin-Xie

in lmms-lab/LLaVA-OneVision-1.5-Instruct-Data 2 days ago

Request to Unlock More Storage for lmms-lab/LLaVA-OneVision-1.5-Instruct-Data

#9 opened 2 days ago by

winking636

updated a dataset 3 days ago

lmms-lab/LLaVA-One-Vision-1.5-Mid-Training-85M

Preview • Updated 3 days ago • 106k • 39

russwang

authored a paper 3 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published 4 days ago • 179

russwang

authored a paper 10 days ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published 15 days ago • 40

luodian

authored 2 papers 13 days ago

MME-Survey: A Comprehensive Survey on Evaluation of Multimodal LLMs

Paper • 2411.15296 • Published Nov 22, 2024 • 21

Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos

Paper • 2501.13826 • Published Jan 23 • 25

xiangan

authored a paper 13 days ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published 15 days ago • 40

luodian

authored 2 papers 13 days ago

LLaVA-OneVision-1.5: Fully Open Framework for Democratized Multimodal Training

Paper • 2509.23661 • Published 15 days ago • 40

Visual Jigsaw Post-Training Improves MLLMs

Paper • 2509.25190 • Published 14 days ago • 35

yshenaw

authored 5 papers 14 days ago

Complete Dictionary Learning via $\ell_p$-norm Maximization

Paper • 2002.10043 • Published Feb 24, 2020

ImageBrush: Learning Visual In-Context Instructions for Exemplar-Based Image Manipulation

Paper • 2308.00906 • Published Aug 2, 2023 • 13

Sparse Mixture-of-Experts are Domain Generalizable Learners

Paper • 2206.04046 • Published Jun 8, 2022 • 1

CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation

Paper • 2404.19394 • Published Apr 30, 2024

ALPINE: Unveiling the Planning Capability of Autoregressive Learning in Language Models

Paper • 2405.09220 • Published May 15, 2024 • 28