Spaces:
Running
on
A10G
Running
on
A10G
metadata
title: WavJourney
emoji: π₯
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 3.40.1
app_file: app.py
pinned: false
license: cc-by-nc-nd-4.0
Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference
How to run WavJourney?
- Install environment by following the bash scripts in
EnvsSetup/
- Start API services; The service logs are in the folder of
logs/
python scripts/start_services.py
- Run AudioJourney client; The results of scripts and audio are in the folder of
output/[datetime]_[instruction text]/
conda activate AudioJourney
python audiojourney_cli.py -f --instruction "News channel BBC broadcast about Trump playing street fighter 6 against Biden"
- Kill the API services
python scripts/kill_services.py
- Start the UI
sh scripts/start_ui.sh
Voice Presets
You can add voice presets to WavJourney to customize the voice actors. Simply provide the voice id, the description and a sample wav file, and WavJourney will pick the voice automatically based on the audio script.
Predefined system voice presets are in data/voice_presets
, whereas session voice presets are in each session's individual folder. See the example below:
- π project_folder
- π data
- π voice_presets <-- system voice presets
- π metadata.json <-- system voice preset metadata
- π npz
- π voice_presets <-- system voice presets
- π output
- π sessions
- π session_1
- π voice_presets <-- session voice presets
- π metadata.json <-- session voice preset metadata
- π npz
- π voice_presets <-- session voice presets
- π session_2
- ...
- π session_1
- π sessions
- π data
Add voice to system voice presets via command line
It's recommended to manage voice presets via UI. However if you want to add voice to voice presets via command line. Run the script below:
python add_voice_preset.py --id "id" --desc "description" --wav-path path/to/wav --session-id session-id
if session-id
is set to '', then you are adding to system voice presets