Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B).
Markus PRO
AI & ML interests
NLP
Recent Activity
liked
a model
1 day ago
marksverdhei/Qwen3-Voice-Embedding-12Hz-1.7B
liked
a model
1 day ago
marksverdhei/Qwen3-Voice-Embedding-12Hz-0.6B
upvoted
a
collection
1 day ago
Qwen3 Voice Embedding
Organizations
FP8 quants
A collection of my FP8 quants for models missing this.
SYAC
Old models from my thesis: https://www.duo.uio.no/handle/10852/96578
Currently using
-
unsloth/GLM-4.7-Flash-GGUF
Text Generation ⢠30B ⢠Updated ⢠358k ⢠438 -
Qwen/Qwen3-TTS-12Hz-0.6B-Base
Text-to-Speech ⢠Updated ⢠179k ⢠166 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition ⢠0.8B ⢠Updated ⢠3.1M ⢠⢠2.81k -
Hexoplon/nb-whisper-large-distil-turbo-beta-ct2
Automatic Speech Recognition ⢠Updated ⢠13 ⢠1
Vintage AI
Old but gold AI classics
Qwen3 Voice Embedding
Standalone ECAPA-TDNN x-vector speaker encoders extracted from Qwen3-TTS. 1024-dim (0.6B) and 2048-dim (1.7B).
Currently using
-
unsloth/GLM-4.7-Flash-GGUF
Text Generation ⢠30B ⢠Updated ⢠358k ⢠438 -
Qwen/Qwen3-TTS-12Hz-0.6B-Base
Text-to-Speech ⢠Updated ⢠179k ⢠166 -
openai/whisper-large-v3-turbo
Automatic Speech Recognition ⢠0.8B ⢠Updated ⢠3.1M ⢠⢠2.81k -
Hexoplon/nb-whisper-large-distil-turbo-beta-ct2
Automatic Speech Recognition ⢠Updated ⢠13 ⢠1
FP8 quants
A collection of my FP8 quants for models missing this.
Vintage AI
Old but gold AI classics
SYAC
Old models from my thesis: https://www.duo.uio.no/handle/10852/96578