WavJourney / README.md
zzk1st's picture
Modified README.md
bd6788c
|
raw
history blame
2.11 kB
metadata
title: WavJourney
emoji: πŸ”₯
colorFrom: blue
colorTo: purple
sdk: gradio
sdk_version: 3.40.1
app_file: app.py
pinned: false
license: cc-by-nc-nd-4.0

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

How to run WavJourney?

  1. Install environment by following the bash scripts in EnvsSetup/
  2. Start API services; The service logs are in the folder of logs/
python scripts/start_services.py
  1. Run AudioJourney client; The results of scripts and audio are in the folder of output/[datetime]_[instruction text]/
conda activate AudioJourney
python audiojourney_cli.py -f --instruction "News channel BBC broadcast about Trump playing street fighter 6 against Biden"
  1. Kill the API services
python scripts/kill_services.py
  1. Start the UI
sh scripts/start_ui.sh

Voice Presets

You can add voice presets to WavJourney to customize the voice actors. Simply provide the voice id, the description and a sample wav file, and WavJourney will pick the voice automatically based on the audio script.

Predefined system voice presets are in data/voice_presets, whereas session voice presets are in each session's individual folder. See the example below:

  • πŸ“‚ project_folder
    • πŸ“‚ data
      • πŸ“‚ voice_presets <-- system voice presets
        • πŸ“„ metadata.json <-- system voice preset metadata
        • πŸ“‚ npz
    • πŸ“‚ output
      • πŸ“‚ sessions
        • πŸ“‚ session_1
          • πŸ“‚ voice_presets <-- session voice presets
            • πŸ“„ metadata.json <-- session voice preset metadata
            • πŸ“‚ npz
        • πŸ“‚ session_2
        • ...

Add voice to system voice presets via command line

It's recommended to manage voice presets via UI. However if you want to add voice to voice presets via command line. Run the script below:

python add_voice_preset.py --id "id" --desc "description" --wav-path path/to/wav --session-id session-id

if session-id is set to '', then you are adding to system voice presets