Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
diwank
's Collections
search
Vision
Art
K
S1.1
Sam
Audio
thought
Audio
updated
1 day ago
Upvote
-
espnet/yodas2
Updated
Jun 10
•
23.9k
•
26
Flux9665/BibleMMS
Viewer
•
Updated
Jun 16
•
736k
•
1.05k
•
65
google/MusicCaps
Viewer
•
Updated
Mar 8, 2023
•
5.52k
•
361
•
124
ShoukanLabs/AniSpeech
Viewer
•
Updated
Jan 29
•
23.7k
•
544
•
37
muzaik/captioned-audio-1k
Viewer
•
Updated
May 28
•
1.05k
•
108
•
4
aoxo/text2asmr-uncensored
Preview
•
Updated
Feb 19
•
227
•
5
google/fleurs
Updated
Aug 25
•
23.3k
•
254
phongdtd/youtube_casual_audio
Updated
Sep 10
•
76
•
4
ProgramComputer/voxceleb
Updated
Jul 27
•
2.04k
•
57
jhu-clsp/seamless-align
Preview
•
Updated
Jun 2
•
180
•
10
IVLLab/MultiDialog
Updated
Aug 29
•
588
•
12
PetraAI/PetraAI
Updated
Sep 14, 2023
•
399
•
20
ReDUB/SoundHarvest
Viewer
•
Updated
Dec 14, 2023
•
2
•
68
•
2
jhu-clsp/seamless-align-expressive
Updated
Feb 22
•
55
•
4
jg583/NSynth
Updated
Apr 26
•
217
•
17
voice-is-cool/voxtube
Viewer
•
Updated
Feb 13
•
4.46M
•
729
•
11
google/speech_commands
Updated
Jan 18
•
1.38k
•
32
Fhrozen/FSD50k
Preview
•
Updated
May 27, 2022
•
1.39k
•
4
nvidia/parakeet-tdt-1.1b
Automatic Speech Recognition
•
Updated
Apr 30
•
489k
•
80
yl4579/StyleTTS2-LibriTTS
Updated
Nov 21, 2023
•
42
coqui/XTTS-v2
Text-to-Speech
•
Updated
Dec 11, 2023
•
1.57M
•
1.98k
facebook/wav2vec2-large-robust
Updated
Nov 5, 2021
•
1.12k
•
31
laion/links_to_pocasts_lecture_and_shows_for_tts
Viewer
•
Updated
May 29
•
331k
•
9
•
8
laion/youtube-urls-for-emotional-tts
Viewer
•
Updated
May 21
•
78.3k
•
49
•
3
laion/chirp-v2-dataset
Viewer
•
Updated
Mar 25
•
64
•
757
•
5
speechcolab/gigaspeech
Viewer
•
Updated
Nov 23, 2023
•
364k
•
10.4k
•
91
fixie-ai/boolq-audio
Viewer
•
Updated
Jun 12
•
12.7k
•
199
•
7
fixie-ai/soda-audio
Viewer
•
Updated
Jul 24
•
102k
•
98
•
3
amphion/Emilia
Preview
•
Updated
Sep 6
•
77
•
81
google/cvss
Updated
Feb 10
•
189
•
12
PolyAI/minds14
Updated
Sep 10
•
4.61k
•
77
Qwen/Qwen2-Audio-7B-Instruct
Audio-Text-to-Text
•
Updated
1 day ago
•
30.1k
•
235
infgrad/dialogue_rewrite_llm
Viewer
•
Updated
Feb 17
•
1.64M
•
66
•
13
FBK-MT/Speech-MASSIVE
Viewer
•
Updated
Aug 8
•
97.6k
•
1.86k
•
31
Qwen/Qwen2-Audio-7B
Audio-Text-to-Text
•
Updated
1 day ago
•
4.73k
•
66
Mozilla/whisperfile
Updated
Oct 2
•
1.14k
•
237
vucinatim/spectrogram-captions
Viewer
•
Updated
Jan 3, 2023
•
1k
•
57
•
2
rachit8562/mel_spectogram_bird_audio
Viewer
•
Updated
Jan 7, 2023
•
72.2k
•
64
•
2
novateur/WavTokenizer
Text-to-Speech
•
Updated
Sep 27
•
44
gpt-omni/mini-omni
Text-to-Speech
•
Updated
Sep 4
•
1
•
399
amphion/Emilia-Dataset
Viewer
•
Updated
Sep 6
•
52.9M
•
57.5k
•
157
FLUX that Plays Music
Paper
•
2409.00587
•
Published
Sep 1
•
31
feizhengcong/FluxMusic
Updated
Aug 31
•
63
fishaudio/fish-speech-1.4
Text-to-Speech
•
Updated
17 days ago
•
16.3k
•
427
ICTNLP/Llama-3.1-8B-Omni
Updated
8 days ago
•
4.35k
•
377
HuggingFaceFV/finevideo
Viewer
•
Updated
17 days ago
•
39.5k
•
13.7k
•
269
kyutai/moshiko-pytorch-bf16
Updated
Sep 18
•
87.2k
•
150
kyutai/moshika-pytorch-bf16
Updated
Sep 18
•
1.54k
•
45
Revai/reverb-asr
Automatic Speech Recognition
•
Updated
Oct 8
•
63
•
73
FBK-MT/mosel
Viewer
•
Updated
23 days ago
•
57.5M
•
1.16k
•
64
homebrewltd/llama3-s-instruct-v0.2
Updated
Aug 23
•
796
•
42
SWivid/F5-TTS
Text-to-Speech
•
Updated
13 days ago
•
563k
•
736
mit-han-lab/hart-0.7b-1024px
Unconditional Image Generation
•
Updated
4 days ago
•
7
THUDM/glm-4-voice-9b
Updated
28 days ago
•
8.17k
•
70
amphion/MaskGCT
Text-to-Speech
•
Updated
28 days ago
•
238
nvidia/parakeet-tdt_ctc-110m
Automatic Speech Recognition
•
Updated
about 1 month ago
•
94.1k
•
13
nvidia/audio-flamingo
Updated
Oct 2
•
16
fishaudio/fish-agent-v0.1-3b
Audio-to-Audio
•
Updated
20 days ago
•
966
•
210
OuteAI/OuteTTS-0.1-350M
Text-to-Speech
•
Updated
15 days ago
•
8.09k
•
283
adamo1139/Meta_Spirit-LM-ungated
Text-to-Audio
•
Updated
Oct 20
•
16
si-pbc/hertz-dev
Audio-to-Audio
•
Updated
7 days ago
•
194
pyannote/speech-separation-ami-1.0
Updated
10 days ago
•
65.7k
•
40
nyuuzyou/suno
Preview
•
Updated
1 day ago
•
61
•
25
gpt-omni/mini-omni2
Any-to-Any
•
Updated
28 days ago
•
1.91k
•
186
fixie-ai/ultravox-v0_4_1-llama-3_1-70b
Feature Extraction
•
Updated
3 days ago
•
425
•
21
aiola/whisper-ner-tag-and-mask-v1
Automatic Speech Recognition
•
Updated
about 4 hours ago
•
33
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections