Spaces:

Vinay15
/

Fine-tuning_TTS_for_a_Regional_Language

Sleeping

App Files Files Community

Vinay15 commited on Oct 28

Commit

72e2358

•

1 Parent(s): d248bff

Update app.py

Browse files

Files changed (1) hide show

app.py +24 -6

app.py CHANGED Viewed

@@ -41,20 +41,38 @@ def synthesize_speech(text):
     return (16000, speech.cpu().numpy())
 # Title and description for the Gradio interface
-title = "Fine-tuning TTS for a Italian Language Using SpeechT5"
-description = """
-This Space generates speech in Italian using the fine-tuned SpeechT5 model from Hugging Face.
-The model is fine-tuned on the VoxPopuli Italian dataset.
 """
-# Create Gradio interface
 interface = gr.Interface(
     fn=synthesize_speech,
     inputs=gr.Textbox(label="Input Text", placeholder="Enter Italian text here..."),
     outputs=gr.Audio(label="Generated Speech"),
     title=title,
     description=description,
-    examples=["Questa è una dimostrazione di sintesi vocale in italiano."]
 )
 # Launch the interface

     return (16000, speech.cpu().numpy())
 # Title and description for the Gradio interface
+title = "Fine-tuning TTS for Italian as a Regional Language Using SpeechT5"
+description = f"""
+This Space generates speech in Italian, a regional language, using a fine-tuned SpeechT5 model from Hugging Face.
+Italian is considered a regional language because it is primarily spoken within Italy and a few Italian-speaking regions in
+other countries, such as Switzerland, San Marino, Vatican City, and areas in Croatia and Slovenia.
+With about 85 million speakers worldwide, Italian's regional usage contrasts with the global reach of languages like English or Spanish.
+**Fine-Tuned Model Preparation:** This model has been fine-tuned using the VoxPopuli Italian dataset to optimize SpeechT5 for
+Italian pronunciation, intonation, and fluency. The fine-tuning process involved preprocessing the text data to ensure accurate
+Italian accents and phonetics, resulting in high-quality Italian speech synthesis.
+The fine-tuned model is available [here](https://huggingface.co/Vinay15/speecht5_finetuned_voxpopuli_it).
+**Note:** Processing time may vary based on sentence length. Longer sentences may take more time to process and generate audio.
+For more details, visit the [GitHub repository](https://github.com/Vinay152003/Fine-tuning-TTS-for-a-Italian-it-Language) and review the project [report](https://drive.google.com/file/d/1cvNPkuFlTZAu1iDaagCwVRGXFd6r6vqi/view?usp=sharing).
 """
+# Create Gradio interface with multiple examples
 interface = gr.Interface(
     fn=synthesize_speech,
     inputs=gr.Textbox(label="Input Text", placeholder="Enter Italian text here..."),
     outputs=gr.Audio(label="Generated Speech"),
     title=title,
     description=description,
+    examples=[
+        ["Questa è una dimostrazione di sintesi vocale in italiano."],
+        ["Benvenuti alla nostra piattaforma di sintesi vocale!"],
+        ["Il modello è stato addestrato per parlare l'italiano in modo naturale e fluido."],
+        ["Oggi il tempo è bello e il sole splende."],
+        ["La città di Roma è una delle destinazioni turistiche più popolari al mondo."]
+    ]
 )
 # Launch the interface