Spaces:

jlopez00
/

tts-service

Runtime error

App Files Files Community

jlopez00 commited on Dec 2, 2024

Commit

2c01ee6

verified ·

1 Parent(s): b3385db

Upload folder using huggingface_hub

Browse files

Files changed (15) hide show

.vscode/launch.json +1 -1
README.md +1 -1
logging.yml +8 -0
notebooks/sample.txt +7 -34
poetry.lock +12 -1
pyproject.toml +2 -0
requirements.txt +3 -0
rvc/infer/infer.py +6 -0
rvc/lib/tools/prerequisites_download.py +20 -4
tabs/workflow/workflow.py +14 -1
tts_service/app.py +6 -38
tts_service/cli.py +9 -31
tts_service/start.py +30 -0
tts_service/tts.py +4 -1
tts_service/voices.py +6 -1

.vscode/launch.json CHANGED Viewed

@@ -17,7 +17,7 @@
             "name": "App",
             "type": "debugpy",
             "request": "launch",
-            "program": "tts_service/app.py",
             "args": ["--open"],
             "console": "integratedTerminal",
             "envFile": "${workspaceFolder}/.env",

             "name": "App",
             "type": "debugpy",
             "request": "launch",
+            "program": "tts_service/start.py",
             "args": ["--open"],
             "console": "integratedTerminal",
             "envFile": "${workspaceFolder}/.env",

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 title: tts-service
-app_file: tts_service/app.py
 sdk: gradio
 sdk_version: 4.43.0
 ---

 ---
 title: tts-service
+app_file: tts_service/start.py
 sdk: gradio
 sdk_version: 4.43.0
 ---

logging.yml CHANGED Viewed

@@ -9,6 +9,14 @@ handlers:
     formatter: simple
     stream: ext://sys.stdout
 loggers:
   tts_service:
     level: INFO
     handlers: [console]

     formatter: simple
     stream: ext://sys.stdout
 loggers:
+  rvc:
+    level: INFO
+    handlers: [console]
+    propagate: no
+  tabs:
+    level: INFO
+    handlers: [console]
+    propagate: no
   tts_service:
     level: INFO
     handlers: [console]

notebooks/sample.txt CHANGED Viewed

@@ -1,34 +1,7 @@
-Thanksgiving in America 2024: Social devastation for the working class, billions more for the oligarchs
-By Barry Grey
-On Thanksgiving Day 2024, the broad mass of the American population has precious little to be thankful for. Inflated costs for all necessities---housing, food, healthcare, childcare, transport---continue to weigh on working class families. Workers are struggling to scrape together turkey dinners for their families and friends under conditions where the cost of a Thanksgiving meal is up 19 percent from pre-pandemic 2019, according to the American Farm Bureau Federation.
-But there is an entirely different reality in the environs of the rich and the super-rich. Champagne glasses are clinking on Wall Street, at Donald Trump's Mar-a-Lago estate, at the White House and on Capitol Hill as the stock market booms and the bonanza reaped by America's oligarchs and their political servants under Joe Biden is set to be massively amplified under the incoming Trump administration.
-Outgoing President Biden is spending his holiday at the Nantucket home of David Rubenstein, billionaire co-founder of the Carlyle Group. Defeated presidential candidate Kamala Harris is sunning herself in Hawaii. Most of their voters, and most of Trump's, are struggling simply to get by.
-The ranks of America's billionaires grew to 800 under Biden and their collective wealth increased by 62 percent to more than $6.2 trillion (not counting the additional hundreds of billions amassed in the stock market surge since the election of Trump). As of December of last year, the top 1 percent of Americans took 21 percent of all personal incomes, more than double the share of the bottom 50 percent. The top one percent of Americans owned 35 percent of all personal wealth and the top 10 percent owned 71 percent, while the bottom 50 percent owned just 1 percent.
-In less than eight weeks, Trump's cabal of billionaires, fascists and quacks is set to take office. It will seek to impose an agenda of mass deportations and state repression, escalation of war and genocide, further tax cuts for the rich, the ripping up of what remains of the social safety net, the dismantling of public health and public education, and the lifting of virtually all regulations on big business. Things could not appear rosier for the financial parasites who control both parties and the entire political system.
-Elon Musk, Trump's crony and the richest man in the world, with a fortune of $300 billion and counting, will be joined by fellow billionaire Vivek Ramaswamy at the head of Trump's new Department of Government Efficiency, where they plan to cut the federal budget by $2 trillion. This will mean the gutting of Medicare, Medicaid and Social Security and the firing of hundreds of thousands of government workers.
-This is on top of an already unfolding social disaster for the masses. Almost 40 percent of Americans, in a survey by the Harris Poll for Bloomberg News in December of 2023, said their household recently relied on extra money besides their regular income **** to make ends meet. Of those, 38 percent said the additional amounts barely covered their monthly expenses with nothing left over, and 23 per cent said it wasn't enough to pay their bills.
-Since last December, mass layoffs in auto, aerospace, retail and other sectors have continued to spread, further devastating working class families. These conditions have already sparked a wave of strikes---at Boeing, on the docks, in the auto sector---in which workers rebelled against the union bureaucrats and voted down sellout contracts.
-Here are some key indices of the social reality confronting broad layers of the American people on Thanksgiving Day:
-Hunger
-According to the most recent Census Bureau Household Pulse Survey (October 2023), one out of every eight American adults is struggling to afford enough food. Nearly 28 million adults nationwide---12.5 percent of the adult population---are living in homes where there is either sometimes or often not enough to eat. This is the highest that figure has reached since the first year of the COVID-19 pandemic.
-Homelessness
-The "State of Homelessness" document of the National Alliance to End Homelessness, 2024 edition, reports:
-In 2023, the year-over-year increase in the number of people experiencing homelessness was 12.1 percent, the biggest increase since data collection began in 2007.
-From 2022 to 2023, homelessness for entire families increased by 15.5 percent.
-More people than ever are experiencing homelessness for the first time. From 2019 to 2023, the number of people who entered emergency shelter for the first time increased more than 23 percent. Over the course of 2023, nearly one million people experienced homelessness for the first time.
-Severe housing cost burdens are on the rise. The number of renter households paying more than 50 percent of their income on rent increased dramatically, rising over 12.6 percent between 2015 and 2022.
-More than half of people experiencing sheltered homelessness, and slightly less than half of people experiencing unsheltered homelessness are formally employed.
-Poverty
-According to the Organization for Economic Cooperation and Development (OECD), the United States has the highest poverty rate among the world's 26 most developed countries. The United Nations Children's Fund (UNICEF) ranks the United States second behind Mexico on a scale of what economists call "relative child poverty" when measured against 35 of the world's richest nations.
-In 2023, the official US poverty rate, according to the United States Census Bureau, was 11.1 percent. There were 36.8 million people in poverty in 2023.
-Life expectancy
-Overall life expectancy in the US was 76.4 years as of early 2023, the lowest in over 20 years.
-How is this possible in the richest country in the world? The answer is capitalism. This is a system in which the working class, which produces all of the wealth, is systematically robbed of the vast portion of what it produces on the basis of private ownership of the means of production, production for profit, and the historically outmoded nation-state framework of economic life.
-The levels of oligarchic excess and parasitism in the US are particularly grotesque due to the complete political subordination of the working class to the ruling elite by means of the two-party system. The exclusion of the working class from political life is enforced by the union bureaucracy and its pseudo-left hangers on in such pro-Democratic groups as the Democratic Socialists of America (DSA).
-This is reflected in such facts as a federal minimum wage that remains at $7.25 an hour, not even sufficient to sustain human life. Meanwhile, the two-party monopoly spends a trillion dollars a year on war and the military and a trillion a year to service a national debt of $34 trillion and rising. The latter payouts, which directly enrich the banks and hedge funds, are on top of a combined $12 trillion doled out to rescue the financial elite in the Wall Street crises of 2008 and 2020.
-The millions of workers who voted for Trump did so as a protest against the undisguised indifference of Biden and Harris to the devastating impact of inflation and austerity, including the purging of 40 million people from the Medicaid rolls. They did not vote for dictatorship, the rounding up of immigrant workers in concentration camps policed by the military for summary deportation, the expansion of US imperialism's global war to China, an increase in Washington's support for the genocide in Gaza, or the destruction of millions more jobs and basic social services.
-They will be stunned and enraged by what is coming under Trump, and they will resist, massively and on a revolutionary scale. Leon Trotsky in his monumental History of the Russian Revolution devoted a chapter to "The Tzar and the Tzarina," in which he wrote of the cognitive blindness that seems to possess ruling classes on the eve of revolutionary upheavals. He wrote:
-To that historic flood which was rolling its billows each one closer to the gates of his palace, the last Romanov opposed only a dumb indifference. It seemed as though between his consciousness and his epoch there stood some transparent but absolutely impenetrable medium.
-Trotsky continued:
-The tzar had no need of narcotics: the fatal "dope" was in his blood. Its symptoms merely seemed especially striking on the background of those great events of war and domestic crisis which led up to the revolution.
-The American ruling class is facing a historical reckoning. The social force that alone can stop and reverse the slide to fascism and world war is the working class, in the US and internationally. It will fight, but it requires a scientific Marxist and internationalist perspective and strategy and the building of a new leadership, which can be accomplished only by the Trotskyist movement, the Socialist Equality Party and the International Committee of the Fourth International. To all those who see the dangers and want to fight, we say join the SEP and take up the fight for the political independence of the working class and socialism!

+The International Monetary Fund (IMF) announced on Saturday that it had reached a staff-level agreement under Sri Lanka’s $US3 billion bailout loan program with the new Janatha Vimukthi Peramuna (JVP)/National Peoples Power (NPP) government and thanked it for its “excellent collaboration.”
+The agreement was announced just two days after President Anura Dissanayake stated in his policy address to parliament on Thursday that the government was committed to implement the IMF demands in full, ditching his previous promise to “renegotiate” the terms.
+All of this is being done in the name of establishing “debt sustainability”—that is, creating conditions to resume loan repayments after the previous Gotabhaya Rajapakse government defaulted in April 2022. From 2028, the Sri Lankan government must resume the payment of $5 billion annually to the international loan sharks.
+In his inaugural speech to the parliament, Dissanayake made clear that his government had rapidly caved in to IMF demands on debt restructuring. The discussions, he said, were in the final stages and his government would reach a common understanding regarding bilateral debt without “debating” whether the plan was “good or bad”—effectively junking his criticisms of the previous President Ranil Wickremesinghe government.

poetry.lock CHANGED Viewed

@@ -5062,6 +5062,17 @@ files = [
 dev = ["Django (>=1.11)", "check-manifest", "colorama (<=0.4.1)", "coverage", "flake8", "nose2", "readme-renderer (<25.0)", "tox", "wheel", "zest.releaser[recommended]"]
 doc = ["Sphinx", "sphinx-rtd-theme"]
 [[package]]
 name = "shellingham"
 version = "1.5.4"
@@ -6194,4 +6205,4 @@ propcache = ">=0.2.0"
 [metadata]
 lock-version = "2.0"
 python-versions = "~3.10"
-content-hash = "9aec331b8bb3a245ef2f232c85343bc8a1114e5d03679e3309882d59d617e083"

 dev = ["Django (>=1.11)", "check-manifest", "colorama (<=0.4.1)", "coverage", "flake8", "nose2", "readme-renderer (<25.0)", "tox", "wheel", "zest.releaser[recommended]"]
 doc = ["Sphinx", "sphinx-rtd-theme"]
+[[package]]
+name = "sh"
+version = "2.1.0"
+description = "Python subprocess replacement"
+optional = false
+python-versions = "<4.0,>=3.8.1"
+files = [
+    {file = "sh-2.1.0-py3-none-any.whl", hash = "sha256:bf5e44178dd96a542126c2774e9b7ab1d89bfe0e2ef84d92e6d0ed7358d63d01"},
+    {file = "sh-2.1.0.tar.gz", hash = "sha256:7e27301c574bec8ca5bf6f211851357526455ee97cd27a7c4c6cc5e2375399cb"},
+]
 [[package]]
 name = "shellingham"
 version = "1.5.4"
 [metadata]
 lock-version = "2.0"
 python-versions = "~3.10"
+content-hash = "d1a5b230b811073006a1e63ea0853c11dc6e27ada05990ef3adc730ef1ed861c"

pyproject.toml CHANGED Viewed

@@ -51,6 +51,7 @@ wget = "^3.2"
 httpx = "^0.28.0"
 pandoc = "^2.4"
 pyyaml = "^6.0.2"
 [tool.poetry.group.ci.dependencies]
 gradio = "4.43.0"
@@ -159,6 +160,7 @@ module = [
     "pypresence",
     "resampy",
     "scipy.*",
     "sklearn.*",
     "soundfile",
     "stftpitchshift",

 httpx = "^0.28.0"
 pandoc = "^2.4"
 pyyaml = "^6.0.2"
+sh = "^2.1.0"
 [tool.poetry.group.ci.dependencies]
 gradio = "4.43.0"
     "pypresence",
     "resampy",
     "scipy.*",
+    "sh",
     "sklearn.*",
     "soundfile",
     "stftpitchshift",

requirements.txt CHANGED Viewed

@@ -2160,6 +2160,9 @@ segments==2.2.1 ; python_version >= "3.10" and python_version < "3.11" \
 semantic-version==2.10.0 ; python_version >= "3.10" and python_version < "3.11" \
     --hash=sha256:bdabb6d336998cbb378d4b9db3a4b56a1e3235701dc05ea2690d9a997ed5041c \
     --hash=sha256:de78a3b8e0feda74cabc54aab2da702113e33ac9d9eb9d2389bcf1f58b7d9177
 shellingham==1.5.4 ; python_version >= "3.10" and python_version < "3.11" and sys_platform != "emscripten" \
     --hash=sha256:7ecfff8f2fd72616f7481040475a65b2bf8af90a56c89140852d1120324e8686 \
     --hash=sha256:8dbca0739d487e5bd35ab3ca4b36e11c4078f3a234bfce294b0a0291363404de

 semantic-version==2.10.0 ; python_version >= "3.10" and python_version < "3.11" \
     --hash=sha256:bdabb6d336998cbb378d4b9db3a4b56a1e3235701dc05ea2690d9a997ed5041c \
     --hash=sha256:de78a3b8e0feda74cabc54aab2da702113e33ac9d9eb9d2389bcf1f58b7d9177
+sh==2.1.0 ; python_version >= "3.10" and python_version < "3.11" \
+    --hash=sha256:7e27301c574bec8ca5bf6f211851357526455ee97cd27a7c4c6cc5e2375399cb \
+    --hash=sha256:bf5e44178dd96a542126c2774e9b7ab1d89bfe0e2ef84d92e6d0ed7358d63d01
 shellingham==1.5.4 ; python_version >= "3.10" and python_version < "3.11" and sys_platform != "emscripten" \
     --hash=sha256:7ecfff8f2fd72616f7481040475a65b2bf8af90a56c89140852d1120324e8686 \
     --hash=sha256:8dbca0739d487e5bd35ab3ca4b36e11c4078f3a234bfce294b0a0291363404de

rvc/infer/infer.py CHANGED Viewed

@@ -241,20 +241,26 @@ class VoiceConverter:
             start_time = time.time()
             log.info(f"Converting audio '{audio_input_path}'...")
             if upscale_audio:
                 from audio_upscaler import upscale
                 upscale(audio_input_path, audio_input_path)
             audio = load_audio_infer(
                 audio_input_path,
                 16000,
                 **kwargs,
             )
             audio_max = np.abs(audio).max() / 0.95
             if audio_max > 1:
                 audio /= audio_max
             if not self.hubert_model or embedder_model != self.last_embedder_model:
                 self.load_hubert(embedder_model, embedder_model_custom)
                 self.last_embedder_model = embedder_model

             start_time = time.time()
             log.info(f"Converting audio '{audio_input_path}'...")
+            # Step 1: Upscale to 48kHz using Predict() model. Currently disabled
             if upscale_audio:
                 from audio_upscaler import upscale
                 upscale(audio_input_path, audio_input_path)
+            # Step 2: Load input audio file and downsample to 16kHz mono
             audio = load_audio_infer(
                 audio_input_path,
                 16000,
                 **kwargs,
             )
+            # Step 3: Normalize audio to 105%
             audio_max = np.abs(audio).max() / 0.95
             if audio_max > 1:
                 audio /= audio_max
+            # Step 4: Load hubert model
             if not self.hubert_model or embedder_model != self.last_embedder_model:
                 self.load_hubert(embedder_model, embedder_model_custom)
                 self.last_embedder_model = embedder_model

rvc/lib/tools/prerequisites_download.py CHANGED Viewed

@@ -1,11 +1,16 @@
 import os
 from concurrent.futures import ThreadPoolExecutor
 import requests
 from tqdm import tqdm
 from tts_service.voices import voice_manager
 url_base = "https://huggingface.co/IAHispano/Applio/resolve/main/Resources"
 pretraineds_v1_list = [
@@ -69,7 +74,7 @@ def get_file_size_if_missing(file_list: list[tuple[str, list[str]]]) -> int:
             destination_path = os.path.join(local_folder, file)
             if not os.path.exists(destination_path):
                 url = f"{url_base}/{remote_folder}{file}"
-                response = requests.head(url)
                 total_size += int(response.headers.get("content-length", 0))
     return total_size
@@ -85,10 +90,15 @@ def download_file(url: str, destination_path: str, global_bar: tqdm) -> None:
         os.makedirs(dir_name, exist_ok=True)
     response = requests.get(url, stream=True)
     block_size = 1024
     with open(destination_path, "wb") as file:
         for data in response.iter_content(block_size):
             file.write(data)
             global_bar.update(len(data))
 def download_mapping_files(file_mapping_list: list[tuple[str, list[str]]], global_bar: tqdm) -> None:
@@ -152,7 +162,7 @@ def calculate_total_size(
     return total_size
-def prequisites_download_pipeline(
     pretraineds_v1_f0: bool,
     pretraineds_v1_nof0: bool,
     pretraineds_v2_f0: bool,
@@ -163,6 +173,10 @@ def prequisites_download_pipeline(
     """
     Manage the download pipeline for different categories of files.
     """
     total_size = calculate_total_size(
         pretraineds_v1_f0_list if pretraineds_v1_f0 else [],
         pretraineds_v1_nof0_list if pretraineds_v1_nof0 else [],
@@ -173,7 +187,9 @@ def prequisites_download_pipeline(
     )
     if total_size > 0:
-        with tqdm(total=total_size, unit="iB", unit_scale=True, desc="Downloading all files") as global_bar:
             if models:
                 download_mapping_files(models_list, global_bar)
                 download_mapping_files(embedders_list, global_bar)
@@ -188,4 +204,4 @@ def prequisites_download_pipeline(
             if voices:
                 voice_manager.download_voice_files(global_bar)
     else:
-        pass

+import logging
 import os
+import sys
 from concurrent.futures import ThreadPoolExecutor
 import requests
 from tqdm import tqdm
+from tts_service.utils import env_bool
 from tts_service.voices import voice_manager
+log = logging.getLogger(__name__)
 url_base = "https://huggingface.co/IAHispano/Applio/resolve/main/Resources"
 pretraineds_v1_list = [
             destination_path = os.path.join(local_folder, file)
             if not os.path.exists(destination_path):
                 url = f"{url_base}/{remote_folder}{file}"
+                response = requests.head(url, allow_redirects=True)
                 total_size += int(response.headers.get("content-length", 0))
     return total_size
         os.makedirs(dir_name, exist_ok=True)
     response = requests.get(url, stream=True)
     block_size = 1024
+    total = 0
     with open(destination_path, "wb") as file:
         for data in response.iter_content(block_size):
             file.write(data)
             global_bar.update(len(data))
+            total += len(data)
+    global_bar.clear()
+    log.info(f"Downloaded {total:,} bytes to {destination_path}")
+    global_bar.display()
 def download_mapping_files(file_mapping_list: list[tuple[str, list[str]]], global_bar: tqdm) -> None:
     return total_size
+def prerequisites_download_pipeline(
     pretraineds_v1_f0: bool,
     pretraineds_v1_nof0: bool,
     pretraineds_v2_f0: bool,
     """
     Manage the download pipeline for different categories of files.
     """
+    if env_bool("OFFLINE", False):
+        log.info("Skipping download due to OFFLINE environment variable")
+        return
     total_size = calculate_total_size(
         pretraineds_v1_f0_list if pretraineds_v1_f0 else [],
         pretraineds_v1_nof0_list if pretraineds_v1_nof0 else [],
     )
     if total_size > 0:
+        log.info(f"Will download {total_size:,} bytes")
+        miniters = None if sys.stdout.isatty() else total_size
+        with tqdm(total=total_size, unit="iB", unit_scale=True, desc="Downloading...", miniters=miniters) as global_bar:
             if models:
                 download_mapping_files(models_list, global_bar)
                 download_mapping_files(embedders_list, global_bar)
             if voices:
                 voice_manager.download_voice_files(global_bar)
     else:
+        log.info("No files to download")

tabs/workflow/workflow.py CHANGED Viewed

@@ -1,14 +1,20 @@
 import gradio as gr
 from assets.i18n.i18n import I18nAuto
 from tts_service.docs import document_manager
 from tts_service.tts import run_tts_script
 from tts_service.utils import html_to_markdown, markdown_to_text
 i18n = I18nAuto()
 async def fetch_document(source: str) -> tuple[str, gr.Dataset]:
     doc = await document_manager.get_doc(source)
     if doc:
         overline = doc.get("overline")
@@ -29,6 +35,7 @@ async def fetch_document(source: str) -> tuple[str, gr.Dataset]:
         pieces.append(content)
         content = "\n\n".join(pieces)
         text = markdown_to_text(content)
         return content, text
     return "", ""
@@ -48,6 +55,12 @@ def workflow_tab():
                 interactive=True,
             )
             synthesize_button = gr.Button(i18n("Synthesize"))
             status = gr.Textbox(visible=False)
@@ -67,6 +80,6 @@ def workflow_tab():
     synthesize_button.click(
         fn=run_tts_script,
-        inputs=[text],
         outputs=[status, audio],
     )

+import logging
 import gradio as gr
 from assets.i18n.i18n import I18nAuto
 from tts_service.docs import document_manager
 from tts_service.tts import run_tts_script
 from tts_service.utils import html_to_markdown, markdown_to_text
+from tts_service.voices import voice_manager
 i18n = I18nAuto()
+log = logging.getLogger(__name__)
 async def fetch_document(source: str) -> tuple[str, gr.Dataset]:
+    log.info("Fetching document %s", source)
     doc = await document_manager.get_doc(source)
     if doc:
         overline = doc.get("overline")
         pieces.append(content)
         content = "\n\n".join(pieces)
         text = markdown_to_text(content)
+        log.info("Successfully fetched document %s: %s chars", source, len(text))
         return content, text
     return "", ""
                 interactive=True,
             )
+            voice = gr.Dropdown(
+                label=i18n("Voice"),
+                choices=voice_manager.voices.keys(),
+                value=voice_manager.voice_names[0],
+            )
             synthesize_button = gr.Button(i18n("Synthesize"))
             status = gr.Textbox(visible=False)
     synthesize_button.click(
         fn=run_tts_script,
+        inputs=[text, voice],
         outputs=[status, audio],
     )

tts_service/app.py CHANGED Viewed

@@ -1,34 +1,23 @@
-import logging
-import sys
-from pathlib import Path
 import gradio as gr
-import yaml
 import assets.installation_checker as installation_checker
 import assets.themes.loadThemes as loadThemes
 from assets.i18n.i18n import I18nAuto
-from rvc.lib.tools.prerequisites_download import prequisites_download_pipeline
 from tabs.plugins import plugins_core
 from tabs.workflow.workflow import workflow_tab
-from tts_service.utils import env_bool
-# Set up logging
-logging.getLogger("uvicorn").setLevel(logging.WARNING)
-logging.getLogger("httpx").setLevel(logging.WARNING)
-# Import Tabs
 plugins_core.check_new_folders()
 # Run prerequisites
-prequisites_download_pipeline(
     pretraineds_v1_f0=False,
     pretraineds_v1_nof0=False,
     pretraineds_v2_f0=True,
     pretraineds_v2_nof0=False,
     models=True,
-    voices=not env_bool("OFFLINE", False),
 )
 # Initialize i18n
@@ -49,26 +38,5 @@ with gr.Blocks(theme=my_applio, title="TTS Playground", css="footer{display:none
     gr.Markdown(i18n("Enter a page URL, click fetch and then synthesize"))
     with gr.Tab(i18n("Workflow")):
         workflow_tab()
-def setup_logging():
-    path = Path("logging.yml")
-    if not path.exists():
-        return
-    with path.open() as f:
-        from logging.config import dictConfig
-        dictConfig(yaml.safe_load(f))
-def launch_gradio():
-    setup_logging()
-    app.queue(status_update_rate=1).launch(
-        favicon_path="assets/ICON.ico",
-        share="--share" in sys.argv,
-        inbrowser="--open" in sys.argv,
-    )
-if __name__ == "__main__":
-    launch_gradio()

 import gradio as gr
 import assets.installation_checker as installation_checker
 import assets.themes.loadThemes as loadThemes
 from assets.i18n.i18n import I18nAuto
+from rvc.lib.tools.prerequisites_download import prerequisites_download_pipeline
 from tabs.plugins import plugins_core
+from tabs.tts.tts import tts_tab
 from tabs.workflow.workflow import workflow_tab
 plugins_core.check_new_folders()
 # Run prerequisites
+prerequisites_download_pipeline(
     pretraineds_v1_f0=False,
     pretraineds_v1_nof0=False,
     pretraineds_v2_f0=True,
     pretraineds_v2_nof0=False,
     models=True,
+    voices=True,
 )
 # Initialize i18n
     gr.Markdown(i18n("Enter a page URL, click fetch and then synthesize"))
     with gr.Tab(i18n("Workflow")):
         workflow_tab()
+    with gr.Tab(i18n("TTS")):
+        tts_tab()

tts_service/cli.py CHANGED Viewed

@@ -1,10 +1,10 @@
 import os
-from pathlib import Path
-import boto3
 import click
 from click_help_colors import HelpColorsGroup
 from dotenv import load_dotenv
 load_dotenv()
@@ -37,37 +37,15 @@ def service() -> None:
 @click.option("--prefix", "-p", default=lambda: os.environ["VOICES_KEY_PREFIX"], help="the prefix to use for the keys")
 @click.option("--delete", is_flag=True, help="delete extraneous files from dest")
 @click.option("--dry-run", "-n", is_flag=True, help="perform a trial run with no changes made")
-@click.argument("directory", type=click.Path(exists=True, file_okay=False, path_type=Path), nargs=1)
-def upload_voices(bucket: str, prefix: str, delete: bool, dry_run: bool, directory: Path) -> None:
     """Upload voices to the service"""
-    s3 = boto3.client("s3")
-    prefix = prefix.strip("/")
-    names = set()
-    for path in directory.glob("*.pth"):
-        names.add(path.name)
-        with path.open("rb") as file:
-            if dry_run:
-                click.echo(f"Would upload {path.name} to {bucket}/{prefix}")
-            else:
-                s3.put_object(Bucket=bucket, Key=f"{prefix}/{path.name}", Body=file)
-                # s3.upload_fileobj(file, bucket, f"{prefix}/{path.name}")
-    if not names:
-        raise click.ClickException(f"no voices found in directory {directory}")
-    deleted = 0
     if delete:
-        paginator = s3.get_paginator("list_objects_v2")
-        for page in paginator.paginate(Bucket=bucket, Prefix=prefix):
-            for obj in page["Contents"]:
-                key = obj["Key"]
-                if key.split("/")[-1] not in names:
-                    if dry_run:
-                        click.echo(f"Would delete {key}")
-                    else:
-                        s3.delete_object(Bucket=bucket, Key=key)
-                    deleted += 1
-    deleted_message = f", {deleted} deleted" if delete else ""
-    if not dry_run:
-        click.echo(f"{bucket}/{prefix}: {len(names)} voices uploaded{deleted_message}")
 if __name__ == "__main__":

 import os
+import sys
 import click
 from click_help_colors import HelpColorsGroup
 from dotenv import load_dotenv
+from sh import aws
 load_dotenv()
 @click.option("--prefix", "-p", default=lambda: os.environ["VOICES_KEY_PREFIX"], help="the prefix to use for the keys")
 @click.option("--delete", is_flag=True, help="delete extraneous files from dest")
 @click.option("--dry-run", "-n", is_flag=True, help="perform a trial run with no changes made")
+@click.argument("directory", type=click.Path(exists=True, file_okay=False), nargs=1)
+def upload_voices(bucket: str, prefix: str, delete: bool, dry_run: bool, directory: str) -> None:
     """Upload voices to the service"""
+    args = [directory, f"s3://{bucket}/{prefix}/"]
     if delete:
+        args.insert(0, "--delete")
+    if dry_run:
+        args.insert(0, "--dryrun")
+    aws.s3.sync(*args, _out=sys.stdout, _err=sys.stderr)
 if __name__ == "__main__":

tts_service/start.py ADDED Viewed

	@@ -0,0 +1,30 @@

+import sys
+from pathlib import Path
+import yaml
+def setup_logging():
+    path = Path("logging.yml")
+    if not path.exists():
+        return
+    with path.open() as f:
+        from logging.config import dictConfig
+        dictConfig(yaml.safe_load(f))
+def launch_gradio():
+    setup_logging()
+    from tts_service.app import app
+    app.queue(status_update_rate=1).launch(
+        favicon_path="assets/ICON.ico",
+        share="--share" in sys.argv,
+        inbrowser="--open" in sys.argv,
+    )
+if __name__ == "__main__":
+    launch_gradio()

tts_service/tts.py CHANGED Viewed

@@ -23,7 +23,7 @@ def import_voice_converter():
 # TTS
 async def run_tts_script(
     text: str,
-    voice_name: str = "male-1",
     rate: int = 0,
     progress=gr.Progress(),  # noqa: B008
 ) -> tuple[str, str]:
@@ -32,6 +32,8 @@ async def run_tts_script(
         progress(pct, msg)
         await asyncio.sleep(0)
     await update_progress(0, "Starting...")
     voice = voice_manager.voices[voice_name]
     format = "wav"
@@ -104,6 +106,7 @@ async def run_tts_script(
             callback=lambda pct: update_progress(0.5 + pct / 2, "Converting..."),
         )
     return "Text synthesized successfully.", str(output_rvc_path)

 # TTS
 async def run_tts_script(
     text: str,
+    voice_name: str,
     rate: int = 0,
     progress=gr.Progress(),  # noqa: B008
 ) -> tuple[str, str]:
         progress(pct, msg)
         await asyncio.sleep(0)
+    log.info("Synthesizing text (%s chars)", len(text))
     await update_progress(0, "Starting...")
     voice = voice_manager.voices[voice_name]
     format = "wav"
             callback=lambda pct: update_progress(0.5 + pct / 2, "Converting..."),
         )
+    log.info("Successfully synthesized text (%s chars)", len(text))
     return "Text synthesized successfully.", str(output_rvc_path)

tts_service/voices.py CHANGED Viewed

@@ -10,6 +10,8 @@ from tqdm import tqdm
 from .utils import data_dir, env_str
 @dataclass
 class S3VoiceObj:
@@ -89,6 +91,9 @@ class VoiceManager:
             destination_path = self.voices_dir / obj.name
             if not destination_path.exists() or destination_path.stat().st_size != obj.size:
                 self.s3.download_file(Bucket=self.bucket, Key=obj.key, Filename=destination_path, Callback=callback)
     @cached_property
     def tts_voices(self) -> dict[str, TTSVoice]:
@@ -103,7 +108,7 @@ class VoiceManager:
     @cached_property
     def voices(self) -> dict[str, Voice]:
         rv = {}
-        for path in self.voices_dir.glob("*.json"):
             voice = Voice.model_validate_json(path.read_bytes())
             model_path = self.voices_dir / f"{voice.model}"
             if not model_path.exists():

 from .utils import data_dir, env_str
+log = logging.getLogger(__name__)
 @dataclass
 class S3VoiceObj:
             destination_path = self.voices_dir / obj.name
             if not destination_path.exists() or destination_path.stat().st_size != obj.size:
                 self.s3.download_file(Bucket=self.bucket, Key=obj.key, Filename=destination_path, Callback=callback)
+                progress_bar.clear()
+                log.info(f"Downloaded {obj.size:,} bytes to {destination_path}")
+                progress_bar.display()
     @cached_property
     def tts_voices(self) -> dict[str, TTSVoice]:
     @cached_property
     def voices(self) -> dict[str, Voice]:
         rv = {}
+        for path in sorted(self.voices_dir.glob("*.json")):
             voice = Voice.model_validate_json(path.read_bytes())
             model_path = self.voices_dir / f"{voice.model}"
             if not model_path.exists():