The ParaSpeechCLAP models and datasets used to train them.
Anuj Diwan
ajd12342
AI & ML interests
None yet
Recent Activity
updated a collection 1 day ago
ParaSpeechCLAP: Dual-Encoder Speech-Text Model updated a collection 1 day ago
ParaSpeechCLAP: Dual-Encoder Speech-Text Model updated a collection 1 day ago
ParaSpeechCLAP: Dual-Encoder Speech-Text ModelOrganizations
ParaSpeechCaps: Rich Style Prompted TTS
The ParaSpeechCaps dataset and models trained on it
-
Scaling Rich Style-Prompted Text-to-Speech Datasets
Paper • 2503.04713 • Published • 1 -
ajd12342/paraspeechcaps
Viewer • Updated • 1.07M • 714 • 19 -
ajd12342/parler-tts-mini-v1-paraspeechcaps
Text-to-Speech • 0.9B • Updated • 10 • 5 -
ajd12342/parler-tts-mini-v1-paraspeechcaps-only-base
Text-to-Speech • 0.9B • Updated • 1 • 1
ParaSpeechCLAP: Dual-Encoder Speech-Text Model
The ParaSpeechCLAP models and datasets used to train them.
ParaSpeechCaps: Rich Style Prompted TTS
The ParaSpeechCaps dataset and models trained on it
-
Scaling Rich Style-Prompted Text-to-Speech Datasets
Paper • 2503.04713 • Published • 1 -
ajd12342/paraspeechcaps
Viewer • Updated • 1.07M • 714 • 19 -
ajd12342/parler-tts-mini-v1-paraspeechcaps
Text-to-Speech • 0.9B • Updated • 10 • 5 -
ajd12342/parler-tts-mini-v1-paraspeechcaps-only-base
Text-to-Speech • 0.9B • Updated • 1 • 1
models 5
ajd12342/paraspeechclap-combined
Audio Classification • Updated
ajd12342/paraspeechclap-situational
Audio Classification • Updated
ajd12342/paraspeechclap-intrinsic
Audio Classification • Updated
ajd12342/parler-tts-mini-v1-paraspeechcaps-only-base
Text-to-Speech • 0.9B • Updated • 1 • 1
ajd12342/parler-tts-mini-v1-paraspeechcaps
Text-to-Speech • 0.9B • Updated • 10 • 5
datasets 13
ajd12342/paraspeechcaps-situational-train
Viewer • Updated • 96.2k • 32
ajd12342/paraspeechcaps-intrinsic-train
Viewer • Updated • 945k • 36
ajd12342/paraspeechclap-eval-combined
Viewer • Updated • 1.43k • 31
ajd12342/paraspeechclap-eval-situational
Viewer • Updated • 1.43k • 28
ajd12342/paraspeechclap-eval-intrinsic
Viewer • Updated • 9.4k • 50
ajd12342/paraspeechcaps
Viewer • Updated • 1.07M • 714 • 19
ajd12342/psc-intrinsic-test-dataset-v3-zsc-accent
Viewer • Updated • 2.44k • 6
ajd12342/psc-intrinsic-test-dataset-v3-zsc-rhythm
Viewer • Updated • 1.96k • 10
ajd12342/psc-intrinsic-test-dataset-v3-zsc-volume
Viewer • Updated • 1.22k • 9
ajd12342/psc-intrinsic-test-dataset-v3-zsc-clarity
Viewer • Updated • 875 • 10