Generate realistic voice audio from text and audio prompts
Execute commands based on environment variables