Update README.md
Browse files
README.md
CHANGED
@@ -33,6 +33,9 @@ Part of a [personal project](https://github.com/Respaired/Project-Kanade), focus
|
|
33 |
|
34 |
- Join Shoukan lab's discord server, a comfy place I frequently visit -> [![Discord](https://img.shields.io/discord/1197679063150637117?logo=discord&logoColor=white&label=Join%20our%20Community)](https://discord.gg/JrPSzdcM)
|
35 |
|
|
|
|
|
|
|
36 |
## What is this?
|
37 |
|
38 |
*Note*: This model only supports the Japanese language; but you can feed it Romaji if you use the Gradio demo.
|
@@ -70,6 +73,7 @@ Brought to you by:
|
|
70 |
- [Buttercream](https://github.com/korakoe)
|
71 |
|
72 |
Special thanks to Yinghao Aaron Li, the Author of StyleTTS which this work is based on top of that. <br> He is one of the most talented Engineers I've ever seen in this field.
|
|
|
73 |
|
74 |
## Why does it matter?
|
75 |
|
@@ -173,7 +177,11 @@ pip install -r requirements.txt
|
|
173 |
|
174 |
***There is a wide gap between English and other languages, so I mostly focus on non-English projects. but the good folks at Shoukan labs are trying to train a multilingual model with English included. however, if i ever do myself, it'll be focused on something specific (let's say accents).***
|
175 |
|
176 |
-
8.
|
|
|
|
|
|
|
|
|
177 |
```email
|
178 |
saoshiant@protonmail.com
|
179 |
```
|
|
|
33 |
|
34 |
- Join Shoukan lab's discord server, a comfy place I frequently visit -> [![Discord](https://img.shields.io/discord/1197679063150637117?logo=discord&logoColor=white&label=Join%20our%20Community)](https://discord.gg/JrPSzdcM)
|
35 |
|
36 |
+
Github's repo:
|
37 |
+
[![GitHub](https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white)](https://github.com/Respaired/Tsukasa-Speech)
|
38 |
+
|
39 |
## What is this?
|
40 |
|
41 |
*Note*: This model only supports the Japanese language; but you can feed it Romaji if you use the Gradio demo.
|
|
|
73 |
- [Buttercream](https://github.com/korakoe)
|
74 |
|
75 |
Special thanks to Yinghao Aaron Li, the Author of StyleTTS which this work is based on top of that. <br> He is one of the most talented Engineers I've ever seen in this field.
|
76 |
+
Also Karesto and Raven for their help in debugging some of the scripts. wonderful people.
|
77 |
|
78 |
## Why does it matter?
|
79 |
|
|
|
177 |
|
178 |
***There is a wide gap between English and other languages, so I mostly focus on non-English projects. but the good folks at Shoukan labs are trying to train a multilingual model with English included. however, if i ever do myself, it'll be focused on something specific (let's say accents).***
|
179 |
|
180 |
+
8. Can I use the DDP on the original StyleTTS without any of your modifications?
|
181 |
+
|
182 |
+
***Sure! but you need to do some changes. replace the xlstm's pre projection at around line 922 with the lines of the original's script. you also have to modify prosody encoder and put the contents of F0Ntrain into the forward function of the prosody encoder itself.***
|
183 |
+
|
184 |
+
9. Any questions?
|
185 |
```email
|
186 |
saoshiant@protonmail.com
|
187 |
```
|