9 4 23

ZhenYE

ZhenYe234

https://github.com/zhenye234

zhenye234

AI & ML interests

None yet

Recent Activity

liked a model 4 days ago

SebastianBodza/Kartoffel-1B-v0.3

upvoted a paper 5 days ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

updated a model 6 days ago

HKUSTAudio/Llasa-1B

View all activity

Organizations

ZhenYe234's activity

liked a model 4 days ago

SebastianBodza/Kartoffel-1B-v0.3

Updated 5 days ago • 512 • 4

upvoted a paper 5 days ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published 8 days ago • 60

updated 3 models 6 days ago

liked a model 9 days ago

baichuan-inc/Baichuan-Audio-Instruct

Updated 18 days ago • 158 • 10

updated a model 9 days ago

HKUSTAudio/Llasa-1B-Multilingual

Text-to-Speech • Updated 9 days ago • 2.12k • 26

liked a model 9 days ago

ASLP-lab/LLaSE-G1

Audio-to-Audio • Updated about 12 hours ago • 14

New activity in HKUSTAudio/xcodec2 12 days ago

Non-deterministic behaviour with batch size > 1

#9 opened 14 days ago by

prabhatp251

updated a model 19 days ago

HKUSTAudio/xcodec2

Audio-to-Audio • Updated 19 days ago • 18.2k • 57

updated a model 22 days ago

ZhenYe234/hubert_base_general_audio

Updated 22 days ago • 221k • 1

liked a model 22 days ago

ZhenYe234/hubert_base_general_audio

Updated 22 days ago • 221k • 1

liked 4 models 29 days ago

HKUSTAudio/Llasa-1B-multi-speakers-genshin-zh-en-ja-ko

Text-to-Speech • Updated 29 days ago • 145 • 2

HKUSTAudio/Llasa-3B-Preserve-TextChat

Text-to-Speech • Updated 29 days ago • 34 • 2

HKUSTAudio/Llasa-1B-Preserve-TextChat

Text-to-Speech • Updated 29 days ago • 33 • 2

HKUSTAudio/Llasa-1B-two-speakers-kore-puck

Text-to-Speech • Updated 29 days ago • 285 • 4

liked 2 Spaces 29 days ago

Llasa 1B Finetuned For Two Speakers

🔥

Using dataset shb777/gemini-flash-2.0-speech for finetuning

Llasa 1B Multi Speakers Genshin Zh En Ja Ko

🚀

Llasa-1B-Multilingual finetuned using simon3000/genshin-voic

updated a collection 29 days ago

Llasa

Collection

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 11 items • Updated 22 days ago • 15