MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 2 days ago • 45
Visual Document Retrieval Collection A collection of models, datasets, and spaces in the VDR series • 5 items • Updated 6 days ago • 8
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 6 days ago • 54
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos Paper • 2501.04001 • Published 9 days ago • 40
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published Dec 10, 2024 • 45
[MASK] is All You Need Collection Code, dataset, and pretrained model • 5 items • Updated Nov 29, 2024 • 9