Audio - a diwank Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

diwank 's Collections

search

Vision

Art

K

S1.1

Sam

Audio

thought

Audio

updated 1 day ago

espnet/yodas2

Updated Jun 10 • 23.9k • 26
Flux9665/BibleMMS

Viewer • Updated Jun 16 • 736k • 1.05k • 65
google/MusicCaps

Viewer • Updated Mar 8, 2023 • 5.52k • 361 • 124
ShoukanLabs/AniSpeech

Viewer • Updated Jan 29 • 23.7k • 544 • 37
muzaik/captioned-audio-1k

Viewer • Updated May 28 • 1.05k • 108 • 4
aoxo/text2asmr-uncensored

Preview • Updated Feb 19 • 227 • 5
google/fleurs

Updated Aug 25 • 23.3k • 254
phongdtd/youtube_casual_audio

Updated Sep 10 • 76 • 4
ProgramComputer/voxceleb

Updated Jul 27 • 2.04k • 57
jhu-clsp/seamless-align

Preview • Updated Jun 2 • 180 • 10
IVLLab/MultiDialog

Updated Aug 29 • 588 • 12
PetraAI/PetraAI

Updated Sep 14, 2023 • 399 • 20
ReDUB/SoundHarvest

Viewer • Updated Dec 14, 2023 • 2 • 68 • 2
jhu-clsp/seamless-align-expressive

Updated Feb 22 • 55 • 4
jg583/NSynth

Updated Apr 26 • 217 • 17
voice-is-cool/voxtube

Viewer • Updated Feb 13 • 4.46M • 729 • 11
google/speech_commands

Updated Jan 18 • 1.38k • 32
Fhrozen/FSD50k

Preview • Updated May 27, 2022 • 1.39k • 4
nvidia/parakeet-tdt-1.1b

Automatic Speech Recognition • Updated Apr 30 • 489k • 80
yl4579/StyleTTS2-LibriTTS

Updated Nov 21, 2023 • 42
coqui/XTTS-v2

Text-to-Speech • Updated Dec 11, 2023 • 1.57M • 1.98k
facebook/wav2vec2-large-robust

Updated Nov 5, 2021 • 1.12k • 31
laion/links_to_pocasts_lecture_and_shows_for_tts

Viewer • Updated May 29 • 331k • 9 • 8
laion/youtube-urls-for-emotional-tts

Viewer • Updated May 21 • 78.3k • 49 • 3
laion/chirp-v2-dataset

Viewer • Updated Mar 25 • 64 • 757 • 5
speechcolab/gigaspeech

Viewer • Updated Nov 23, 2023 • 364k • 10.4k • 91
fixie-ai/boolq-audio

Viewer • Updated Jun 12 • 12.7k • 199 • 7
fixie-ai/soda-audio

Viewer • Updated Jul 24 • 102k • 98 • 3
amphion/Emilia

Preview • Updated Sep 6 • 77 • 81
google/cvss

Updated Feb 10 • 189 • 12
PolyAI/minds14

Updated Sep 10 • 4.61k • 77
Qwen/Qwen2-Audio-7B-Instruct

Audio-Text-to-Text • Updated 1 day ago • 30.1k • 235
infgrad/dialogue_rewrite_llm

Viewer • Updated Feb 17 • 1.64M • 66 • 13
FBK-MT/Speech-MASSIVE

Viewer • Updated Aug 8 • 97.6k • 1.86k • 31
Qwen/Qwen2-Audio-7B

Audio-Text-to-Text • Updated 1 day ago • 4.73k • 66
Mozilla/whisperfile

Updated Oct 2 • 1.14k • 237
vucinatim/spectrogram-captions

Viewer • Updated Jan 3, 2023 • 1k • 57 • 2
rachit8562/mel_spectogram_bird_audio

Viewer • Updated Jan 7, 2023 • 72.2k • 64 • 2
novateur/WavTokenizer

Text-to-Speech • Updated Sep 27 • 44
gpt-omni/mini-omni

Text-to-Speech • Updated Sep 4 • 1 • 399
amphion/Emilia-Dataset

Viewer • Updated Sep 6 • 52.9M • 57.5k • 157
FLUX that Plays Music

Paper • 2409.00587 • Published Sep 1 • 31
feizhengcong/FluxMusic

Updated Aug 31 • 63
fishaudio/fish-speech-1.4

Text-to-Speech • Updated 17 days ago • 16.3k • 427
ICTNLP/Llama-3.1-8B-Omni

Updated 8 days ago • 4.35k • 377
HuggingFaceFV/finevideo

Viewer • Updated 17 days ago • 39.5k • 13.7k • 269
kyutai/moshiko-pytorch-bf16

Updated Sep 18 • 87.2k • 150
kyutai/moshika-pytorch-bf16

Updated Sep 18 • 1.54k • 45
Revai/reverb-asr

Automatic Speech Recognition • Updated Oct 8 • 63 • 73
FBK-MT/mosel

Viewer • Updated 23 days ago • 57.5M • 1.16k • 64
homebrewltd/llama3-s-instruct-v0.2

Updated Aug 23 • 796 • 42
SWivid/F5-TTS

Text-to-Speech • Updated 13 days ago • 563k • 736
mit-han-lab/hart-0.7b-1024px

Unconditional Image Generation • Updated 4 days ago • 7
THUDM/glm-4-voice-9b

Updated 28 days ago • 8.17k • 70
amphion/MaskGCT

Text-to-Speech • Updated 28 days ago • 238
nvidia/parakeet-tdt_ctc-110m

Automatic Speech Recognition • Updated about 1 month ago • 94.1k • 13
nvidia/audio-flamingo

Updated Oct 2 • 16
fishaudio/fish-agent-v0.1-3b

Audio-to-Audio • Updated 20 days ago • 966 • 210
OuteAI/OuteTTS-0.1-350M

Text-to-Speech • Updated 15 days ago • 8.09k • 283
adamo1139/Meta_Spirit-LM-ungated

Text-to-Audio • Updated Oct 20 • 16
si-pbc/hertz-dev

Audio-to-Audio • Updated 7 days ago • 194
pyannote/speech-separation-ami-1.0

Updated 10 days ago • 65.7k • 40
nyuuzyou/suno

Preview • Updated 1 day ago • 61 • 25
gpt-omni/mini-omni2

Any-to-Any • Updated 28 days ago • 1.91k • 186
fixie-ai/ultravox-v0_4_1-llama-3_1-70b

Feature Extraction • Updated 3 days ago • 425 • 21
aiola/whisper-ner-tag-and-mask-v1

Automatic Speech Recognition • Updated about 4 hours ago • 33 • 2

Collection guide
Browse collections

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs