File size: 1,136 Bytes
f561f78 f8c78a2 f561f78 f8c78a2 8541c9d f8c78a2 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 |
---
license: cc-by-sa-4.0
language:
- az
base_model: facebook/mms-tts
pipeline_tag: text-to-speech
library_name: transformers
---
<h1> Voice of SARA </h1>
Baku Higher Oil School Research and Development Center on AI introduce their new Text-to-Speech model in collaboration with PRODATA. Model is based on VITS architecture, referenced to Meta MMS on Azerbaijani.
![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/642bb2d07f152f6e72b8ad2a/tOmRzmjzbADs7UNHL09DE.jpeg)
(c) Image has been generated by using Microsoft AI Image Generator!
Meta MMS model has good performance in naturalness of the speech while it was not robust to the change in the input tokens. Intonation varied according to the input tokens.
Our team has built speech and text pairs from public sources and combined them with 2-3 sentences to create continuous speech in the input.
Thanks to Kavsar Huseynova, Elvin Mammadov, Qurban Quliyev and PRODATA for the the contributions to this project!
All rights are reserved!
Note: Team and collaborators are not responsible about the contents of the generated voices by different individuals! |