SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 5 days ago • 35
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 5 days ago • 35
CaptionEmporium/flickr-megalith-10m-internvl2-multi-caption Viewer • Updated Aug 28 • 8.51M • 258 • 9
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer Paper • 2401.10208 • Published Jan 18 • 1
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Paper • 2306.05423 • Published Jun 8, 2023