Spaces:

amphion
/

naturalspeech3_facodec

Running on Zero

Hecheng0625 commited on Mar 11

Commit

4dfbe06

•

1 Parent(s): 97320e7

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -76,6 +76,22 @@ demo = gr.Interface(
     inputs=demo_inputs,
     outputs=demo_outputs,
     title="NaturalSpeech3 FACodec",
 )
 if __name__ == "__main__":

     inputs=demo_inputs,
     outputs=demo_outputs,
     title="NaturalSpeech3 FACodec",
+    description=
+    """
+    ## FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
+    [![arXiv](https://img.shields.io/badge/arXiv-Paper-<COLOR>.svg)](https://arxiv.org/pdf/2403.03100.pdf)
+    [![demo](https://img.shields.io/badge/FACodec-Demo-red)](https://speechresearch.github.io/naturalspeech3/)
+    [![model](https://img.shields.io/badge/%F0%9F%A4%97%20HuggingFace-Models-pink)](https://huggingface.co/amphion/naturalspeech3_facodec)
+    ## Overview
+    FACodec is a core component of the advanced text-to-speech (TTS) model NaturalSpeech 3. FACodec converts complex speech waveform into disentangled subspaces representing speech attributes of content, prosody, timbre, and acoustic details and reconstruct high-quality speech waveform from these attributes. FACodec decomposes complex speech into subspaces representing different attributes, thus simplifying the modeling of speech representation.
+    Research can use FACodec to develop different modes of TTS models, such as non-autoregressive based discrete diffusion (NaturalSpeech 3) or autoregressive models (like VALL-E).
+    """,
 )
 if __name__ == "__main__":