|
--- |
|
license: cc-by-nc-4.0 |
|
datasets: |
|
- mozilla-foundation/common_voice_17_0 |
|
language: |
|
- tr |
|
base_model: |
|
- SWivid/F5-TTS |
|
pipeline_tag: text-to-speech |
|
tags: |
|
- audio |
|
- tts |
|
- turkish |
|
--- |
|
|
|
|
|
Inference with .safetensors option |
|
``` |
|
f5_tts_turkish_1000000.safetensors |
|
vocab.txt |
|
``` |
|
Github: https://github.com/SWivid/F5-TTS |
|
Paper: [F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching](https://huggingface.co/papers/2410.06885) |
|
|
|
|
|
## Samples |
|
|
|
Ref: https://voca.ro/1fxdnqkzN4wR |
|
|
|
Gen: https://voca.ro/1nM46muVinRS |
|
|
|
> **_NOTE:_** You can set the number of nfe steps to 64 to produce better quality sound. |
|
|