ai-pronunciation-trainer / app_description.md
alessandro trinca tornidor
feat: port whisper and faster-whisper support from https://github.com/Thiagohgl/ai-pronunciation-trainer
85b7206

A newer version of the Gradio SDK is available: 5.25.2

Upgrade

Models and variables

Right now this tool uses:

  • {model_name} as STT (speech-to-text) model; other supported models are: {models_names_urls_list}
  • {sample_rate_start} as input samplerate value (from empirical tests the best sample rate value is 48000)
  • {sample_rate_resample} as resampled samplerate value
  • {samplerate_tts} as TTS (text-to-speech) samplerate value