Step-Audio-R1 Collection Step-Audio-R1 is the first audio language model to successfully unlock test-time compute scaling. • 3 items • Updated 14 days ago • 15
wan2.1 controlnets Collection See code on github: https://github.com/TheDenk/wan2.1-dilated-controlnet • 6 items • Updated Oct 7 • 6
Zero-Shot Voice Cloning Collection TTS models that support zero-shot voice cloning • 8 items • Updated 3 days ago • 14
Flux tools in NF4 Collection Contains Flux Fill, Canny, and Dev checkpoints in NF4. • 3 items • Updated Nov 24, 2024 • 10