Running on Zero 10 Magpietts Demo π 10 Generate multilingual speech from text with selectable voices
nvidia/nemotron-speech-streaming-en-0.6b Automatic Speech Recognition β’ Updated Jan 30 β’ 13.3k β’ 484
nvidia/Frame_VAD_Multilingual_MarbleNet_v2.0 Voice Activity Detection β’ Updated May 14, 2025 β’ 5.91k β’ 37
Running on Zero Featured 464 Parakeet-TDT-0.6b-V2 Β 464 Transcribe audio files into timestamped text and subtitles
NeMo-Aligner: Scalable Toolkit for Efficient Model Alignment Paper β’ 2405.01481 β’ Published May 2, 2024 β’ 30
RULER: What's the Real Context Size of Your Long-Context Language Models? Paper β’ 2404.06654 β’ Published Apr 9, 2024 β’ 39
CantTalkAboutThis: Aligning Language Models to Stay on Topic in Dialogues Paper β’ 2404.03820 β’ Published Apr 4, 2024 β’ 25