Generate speech from text using a reference audio sample
Generate audio from text using voice synthesis