Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 867k • 1.51k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7 • 1.06k • 699 sesame/csm-1b Text-to-Speech • Updated Jul 23 • 27.7k • 2.23k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10 • 3.49M • • 5.13k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21 • 2
Small Language Models microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition • 6B • Updated May 1 • 867k • 1.51k
TTS SparkAudio/Spark-TTS-0.5B Text-to-Speech • Updated Mar 7 • 1.06k • 699 sesame/csm-1b Text-to-Speech • Updated Jul 23 • 27.7k • 2.23k hexgrad/Kokoro-82M Text-to-Speech • Updated Apr 10 • 3.49M • • 5.13k speaches-ai/Kokoro-82M-v1.0-ONNX-fp16 Text-to-Speech • Updated Mar 21 • 2