Customizing Text-to-Image Models with a Single Image Pair Paper • 2405.01536 • Published May 2 • 18
Concept Weaver: Enabling Multi-Concept Fusion in Text-to-Image Models Paper • 2404.03913 • Published Apr 5
LCM-Lookahead for Encoder-based Text-to-Image Personalization Paper • 2404.03620 • Published Apr 4 • 1
Customizing Text-to-Image Diffusion with Camera Viewpoint Control Paper • 2404.12333 • Published Apr 18 • 1
jtatman/stable-diffusion-prompts-stats-full-uncensored Viewer • Updated 17 days ago • 897k • 240 • 50
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders Paper • 2408.15998 • Published Aug 28 • 83
SEA: Supervised Embedding Alignment for Token-Level Visual-Textual Integration in MLLMs Paper • 2408.11813 • Published Aug 21 • 11
TokenPacker: Efficient Visual Projector for Multimodal LLM Paper • 2407.02392 • Published Jul 2 • 21
PALP: Prompt Aligned Personalization of Text-to-Image Models Paper • 2401.06105 • Published Jan 11 • 47
CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications Paper • 2408.03703 • Published Aug 7