Generate audio from text using voice synthesis
Generate speech from text using a reference audio sample