This collection contains datasets which contain audio with non-verbal tags such as <laugh>, <sigh> being transcribed.
Christopher Özbek
oezi13
AI & ML interests
Text-To-Speech
Recent Activity
published
a model
about 2 months ago
oezi13/PlayDiffusion-nonverbal
updated
a model
about 2 months ago
oezi13/PlayDiffusion-nonverbal
new activity
about 2 months ago
nvidia/canary-1b-v2:Timestamp accuracy benchmarks?