microsoft/VibeVoice-1.5B
Text-to-Speech β’ 3B β’ Updated β’ 112k β’ 2.38k
Generate multiβspeaker AI podcasts from a script
Generate realistic audio for your video using text prompts
Generate custom images with style and subject references
exapand images with Qwen Image Edit
Launch VibeVoice demo for text-to-speech using CPU
Relight images with Qwen Image Edit
Nano Banana for Hugging Face PRO users
Generate video from an image and audio file
Official Google Nano Banana + WAN 2.2 FAST Video