metadata
language:
- en
base_model:
- coqui/XTTS-v2
pipeline_tag: text-to-speech
tags:
- XTTS
Instructions: Just extract the zipped files into your XTTS-WebUI main directory. Also I recommend that you disable DeepSpeed. While it does cut output times in half, it greatly reduces the output quality.
Version: 0.1.1 Pre-release. About this version: This is the first test build of a model that was built on a manually curated dataset. The dataset was initially created with whisper in step one of XTTS-Finetune. The clips were then manually edited to fix the issue of the clips being cut to short. Also the dataset's metadata was corrected for spelling errors. Dataset length: 3:49