Cannot load the model
Hello
Two quick comments:
- On the sample there are these imports:
import tempfile
import gradio as gr
import numpy as np
import os
import json
I do not think that they are needed. There is no need to install all these dependencies (e..g gradio).
- When trying to load the model, following the receipe in the README.md, Coqui gives the error bellow.
When I try to look at the contents of the config.json I cannot see a plain text file, instead "archive/data.pklFBZZZZZZZZZZZZZ"
May be I should load the model following a specifc instructions? Or may the model is published in an incorrect format? I do not know.
Code used:
def syntetize():
from typing import Optional
from TTS.config import load_config
from TTS.utils.manage import ModelManager
from TTS.utils.synthesizer import Synthesizer
model_path = "aina/model/best_model.pth"
config_path = "aina/model/config.json"
speakers_file_path = "aina/model/speakers.pth"
text = "Hola amics!"
speaker_idx = "1"
synthesizer = Synthesizer(
model_path, config_path, speakers_file_path, None, None, None,
)
wavs = synthesizer.tts(text, speaker_idx)
Thanks
Jordi
File "/home/jordi/sc/aina-python/lib/python3.11/site-packages/TTS/config/init.py", line 89, in load_config
data = json.load(f)
^^^^^^^^^^^^
File "/usr/lib/python3.11/json/init.py", line 293, in load
return loads(fp.read(),
^^^^^^^^^
File "", line 322, in decode
UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 64: invalid start byte
Hi Jordi,
We have stopped maintaining this model, sorry it is still available due to administrative reasons which causes confusion.
Was there a specific reason that you wanted to use the Coqui models? Our most advanced models can be tried here and the instructions to deploy them are in the About (Informació) tab. Let us know what would be the most appropriate way to help you. Thanks.
OK. The reason to use Coqui is because the project already uses Coqui for the other languages.
But it's undertood. Thanks
If Coqui is required for your setup, you can check this config file: https://huggingface.co/spaces/projecte-aina/tts-ca-coqui-vits-multispeaker/blob/main/config_multisteplr.json
However, we think that matxa models have better quality and perfomance than the ones trained with coqui.
Thanks! The link gives a 404.
Is the Coqui model that you are sharing newer that the Catalan model provided by https://github.com/idiap/coqui-ai-TTS?
I'm trying to understand if there is value in using this model instead of the currently provided by Coqui build with you also :)
Thanks
I'm sorry, it was pointing to a private space. You can find it here: https://huggingface.co/projecte-aina/tts-ca-coqui-vits-multispeaker/blob/main/model/config_multisteplr.json This version was trained with more data and is posterior to the one uploaded in coqui.
Thanks. I saw the fixes in the repo. I will try this in the next days and I will let you know if there is a problem.
In order for your work to have maxium reach, I suggest you to consider commiting the new model to:
https://github.com/idiap/coqui-ai-TTS
It seems not too complicated:
https://github.com/idiap/coqui-ai-TTS/commit/cf765cb3f2c4b29f8c91eb5eda52ba2203a09eb3
Thanks again
Jordi