HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units Paper • 2106.07447 • Published Jun 14, 2021 • 4
RMVPE: A Robust Model for Vocal Pitch Estimation in Polyphonic Music Paper • 2306.15412 • Published Jun 27, 2023 • 1
Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech Paper • 2106.06103 • Published Jun 11, 2021 • 4
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context Paper • 2403.05530 • Published Mar 8, 2024 • 65
JackismyShephard/whisper-tiny-finetuned-minds14 Automatic Speech Recognition • 37.8M • Updated Feb 12, 2024 • 1 • 3