Collections
Discover the best community collections!
Collections including paper arxiv:2312.05431
-
aMUSEd: An Open MUSE Reproduction
Paper • 2401.01808 • Published • 28 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper • 2401.01885 • Published • 27 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper • 2401.00604 • Published • 4 -
LARP: Language-Agent Role Play for Open-World Games
Paper • 2312.17653 • Published • 29
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 56 -
VideoBooth: Diffusion-based Video Generation with Image Prompts
Paper • 2312.00777 • Published • 20 -
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Paper • 2312.00079 • Published • 14 -
Efficient Quantization Strategies for Latent Diffusion Models
Paper • 2312.05431 • Published • 11
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 75 -
Natural Language Supervision for General-Purpose Audio Representations
Paper • 2309.05767 • Published • 9 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper • 2309.08532 • Published • 52 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 24