Changelog
v2.4.1 (2024-03-16)
batãã¡ã€ã«ã§ã®ã€ã³ã¹ããŒã«ã»ã¢ããããŒãæ¹æ³ã®å€æŽïŒãã以å€ã®å€æŽã¯ãããŸããïŒ
è«žäºæ ã«ãããã€ã³ã¹ããŒã«ã»ã¢ããããŒãã®batãã¡ã€ã«ãå€æŽããŸããïŒGitã䜿ããªãã®ã§ããŒãžã§ã³ã¢ããæã®ã¢ããããŒãã®å¯Ÿå¿ãå°é£ã ã£ããããGitããªãç°å¢ã®å Žåã¯PortableGitãããŠã³ããŒãããŠäœ¿ãããã«ïŒã
䌎ã£ãŠããããŸã§Windowsã§batãã¡ã€ã«ãããã«ã¯ãªãã¯ããŠã€ã³ã¹ããŒã«ããŠããæ¹ã¯åã€ã³ã¹ããŒã«ãå¿ é ãšãªããŸãã倧å€ç³ãèš³ãããŸããã
ã€ã³ã¹ããŒã«æé
ïŒã€ã³ã¹ããŒã«ã®æµãã¯å€ãããŸããããbatãã¡ã€ã«ã¯å€ãã£ãŠããã®ã§ãæ°ããzipãå¿ ãããŠã³ããŒãããŠãã ããïŒ
- sbv2.zipãããŠã³ããŒããã解åããŠãã ããã
- ã°ã©ããããæ¹ã¯ã
Install-Style-Bert-VITS2.bat
ãããã«ã¯ãªãã¯ããŸãã - ã°ã©ãããªãæ¹ã¯ã
Install-Style-Bert-VITS2-CPU.bat
ãããã«ã¯ãªãã¯ããŸããCPUçã§ã¯åŠç¿ã¯ã§ããŸããããé³å£°åæãšããŒãžã¯å¯èœã§ãã
ã¢ããããŒãæé
以åã®ããŒãžã§ã³ããã®ã¢ããããŒã
ä»ãŸã§ã®ç°å¢ãå šãŠåé€ããŠæ°ããã€ã³ã¹ããŒã«ããå¿ èŠããããŸãã 移è¡æ¹æ³ïŒ
- éèŠãªããŒã¿ãå
¥ã£ãŠããå¯èœæ§ã®ãã
Data
ãã©ã«ããšmodel_assets
ãã©ã«ããããã¯ã¢ãã - äžã®ã€ã³ã¹ããŒã«æé ãããæ°ããå Žæã«Style-Bert-VITS2ãã€ã³ã¹ããŒã«
- ã€ã³ã¹ããŒã«ãçµäºããããããã¯ã¢ãããã
Data
ãã©ã«ããšmodel_assets
ãã©ã«ããæ°ããStyle-Bert-VITS2
ãã©ã«ãã«ã³ã㌠- ãããŸã§ã€ã³ã¹ããŒã«ãããŠãããã©ã«ãïŒbatãã¡ã€ã«ãã¡å«ãïŒã¯åé€ããŠãæ§ããŸãã
ä»åŸã®ã¢ããããŒã
ä»åŸã¯ãæ°ããã€ã³ã¹ããŒã«ãããäžã®Update-Style-Bert-VITS2.bat
ãããã«ã¯ãªãã¯ããŠãã ãããä»ãŸã§ã®Update-Style-Bert-VITS2.bat
çã®ãã¡ã€ã«ã¯äœ¿ããŸããã
v2.4.0 (2024-03-15)
倧èŠæš¡ãªãã¡ã¯ã¿ãªã³ã°ã»æ¥æ¬èªåŠçã®ã¯ãŒã«ãŒåãšæ©èœè¿œå çãããŒã¿ã»ããäœãã»åŠç¿ã»é³å£°åæã»ããŒãžã»ã¹ã¿ã€ã«WebUIã¯å
šãŠapp.py
(App.bat
) ãžçµ±äžãããŸããã®ã§ã泚æãã ããã
ã¢ããããŒãæé
- 2.3æªæºïŒèŸæžã»ãšãã£ã¿ãŒè¿œå åïŒããã®ã¢ããããŒãã®å Žåã¯ãUpdate-to-Dict-Editor.batãããŠã³ããŒããã
Style-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ããã - ãã以å€ã®å Žåã¯ãåçŽã«ä»ãŸã§ã®
Update-Style-Bert-VITS2.bat
ã§ã¢ããããŒãã§ããŸãã - ãã ãã¢ããããŒãã«ããå€ãã®ãã¡ã€ã«ã移åãããäžèŠã«ãªã£ããããã®ã§ãããããåé€ãããå Žåã¯Clean.batã
Update-Style-Bert-VITS2.bat
ãšåãå Žæã«ä¿åããŠå®è¡ããŠãã ããã
å éšæ¹å
- tsukumijimaããã«ãã倧èŠæš¡ãªãã¡ã¯ã¿ãªã³ã°ã®ãã«ãªã¯ ã«ãã£ãŠãå éšã³ãŒããéåžžã«æŽçããå¯èªæ§ãé«ãŸãã©ã€ãã©ãªåãããããtsukumijimaãã 倧å€ãªäœæ¥ãæ¬åœã«ããããšãããããŸãïŒ
- ã©ã€ãã©ãªãšããŠ
pip install style-bert-vits2
ã«ããããã«ã€ã³ã¹ããŒã«ã§ããé³å£°åæéšåã®æ©èœã䜿ããŸãïŒäœ¿çšäŸã¯/library.ipynbãåç §ããŠãã ããïŒ - ãã®ä»ãã®ãã«ãªã¯ã«åæ©ã¥ããããå€ãã®ã³ãŒãã®ãªãã¡ã¯ã¿ãªã³ã°ã»åã¢ãããŒã·ã§ã³ã®è¿œå çãè¡ã£ã
- æ¥æ¬èªåŠçã®pyopenjtalkããœã±ããéä¿¡ãçšããŠå¥ããã»ã¹åããè€æ°åæã«åŠç¿ãé³å£°åæãç«ã¡äžããŠãèŸæžã®ç«¶åãšã©ãŒãèµ·ããªãããã«ãkale4eat ããã«ããPR ã§ããããããšãããããŸãïŒ
ãã°ä¿®æ£
- äžèšã«ãããéããé³å£°åæãšåŠç¿ååŠçãªã©ãæ¥æ¬èªåŠçãæ±ããã®ã2ã€ä»¥äžèµ·åããããšãããšãšã©ãŒãçºçããä»æ§ã®è§£æ±ºããŠãŒã¶ãŒèŸæžã¯è¿œå ããã°åžžã«ã©ãããã§ãé©å¿ãããŸãã
raw
ãã©ã«ãã®çŽäžã§ãªããµããã©ã«ãå ã«é³å£°ãã¡ã€ã«ãããå Žåã«ãwavs
ãã©ã«ãã§ããã®æ§é ãä¿ãããŠããŸããæžãèµ·ãããã¡ã€ã«ãšã®æŽåæ§ãåããªããªãæåãä¿®æ£ããåžžã«wav
ãã©ã«ãçŽäžãžwav
ãã¡ã€ã«ãä¿åããããã«å€æŽ- ã¹ã©ã€ã¹æã«å
ãã¡ã€ã«åã«ããªãªã
.
ãå«ãŸãããšãã¹ã©ã€ã¹åŸã®ãã¡ã€ã«åããããããªããã°ã®ä¿®æ£
æ©èœæ¹åã»è¿œå
- åçš®WebUIãäžã€
app.py
App.bat
ã«çµ±äž - ãã®ä»ä»¥äžã®å€æŽãã軜埮ãªUIã»èª¬ææã®æ¹åç
ããŒã¿ã»ããäœæ
- ã¹ã©ã€ã¹åŠçã®é«éåïŒãã«ãã¹ã¬ããã«ããã倧éã«ã¹ã©ã€ã¹å
ãã¡ã€ã«ãã¡ã€ã«ãããå Žåã«é«éã«ãªããŸãïŒããŸãã¹ã©ã€ã¹å
ã®ãã¡ã€ã«ã
wav
以å€ã®mp3
ãogg
ãªã©ã®åœ¢åŒã«ãå¯Ÿå¿ - ã¹ã©ã€ã¹åŠçæã«ããã¡ã€ã«åã«ã¹ã©ã€ã¹ãããéå§çµäºåºéãå«ãããªãã·ã§ã³ãè¿œå ïŒaka7774 ããã«ããPRã§ããããããšãããããŸãïŒïŒ
- æžãèµ·ããã®é«éåããŸãHugging Faceã®Whisperã¢ãã«ã䜿ããªãã·ã§ã³ãè¿œå ãããããµã€ãºãäžããããšã§VRAMãé£ã代ããã«é床ãå€§å¹ ã«åäžããŸãã
åŠç¿
- åŠç¿å
ã®é³å£°ãã¡ã€ã«ïŒ
Data/ã¢ãã«å/raw
ã«ããããã€ïŒããwav
以å€ã®mp3
ãogg
ãªã©ã®åœ¢åŒã«ã察å¿ïŒååŠç段éã§èªåçã«wav
ãã¡ã€ã«ã«å€æãããŸãïŒïŒãã ãå€ããã1ãã¡ã€ã«2-12ç§çšåºŠã®ç¯å²ã®é·ããæãŸããïŒ
é³å£°åæ
- é³å£°åææã«ãçæé³å£°ã®é³ã®é«ãïŒé³é«ïŒãšææã®å¹
ã調æŽã§ããããã«ïŒãã ãé³è³ªãå°ãå£åããïŒã
App.bat
ãEditor.bat
ã®ã©ã¡ãããã§ã䜿ããŸãã Editor.bat
ã®è€æ°è©±è ã¢ãã«ã§ã®è©±è æå®ãå¯èœã«Editor.bat
ã§ãæ¹è¡ãå«ãæååãããŒã¹ããããšèªåçã«æ¬ãå¢ããããã«ããŸããââãããŒã§æ¬ãè¿œå ã»è¡ãæ¥ã§ããããã«ïŒãšãã£ã¿ãŒåŽã§ä»¥åã«æ¢ã«ã¢ããããŠããŸããïŒEditor.bat
ã§ã¢ãã«äžèŠ§ã®ãªããŒããã¡ãã¥ãŒã«è¿œå
API
server_fastapi.py
ã®å®è¡æã«å šãŠã®ã¢ãã«ãã¡ã€ã«ãèªã¿èŸŒãããšããæåãä¿®æ£ãé³å£°åæããªã¯ãšã¹ããããŠåããŠãã®ã¢ãã«ãèªã¿èŸŒãããã«å€æŽïŒAPIã䜿ããªãé³å£°åæã®ãšããšåãæåïŒserver_fastapi.py
ã®é³å£°åæãšã³ããã€ã³ã/voice
ã«ã€ããŠãGETã¡ãœããã«å ããŠPOSTã¡ãœãããè¿œå ãGETã¡ãœããã§ã¯å€ãã®å¶çŽããããããªã®ã§POSTã䜿ãããšãæšå¥šãããŸãã
CLI
preprocess_text.py
ã§ãæžãèµ·ãããã¡ã€ã«ã§ã®é³å£°ãã¡ã€ã«åãèªåçã«æ£ããData/ã¢ãã«å/wavs/
ãžæžãæãã--correct_path
ãªãã·ã§ã³ã®è¿œå ïŒWebUIã§ã¯ä»ãŸã§ããã®æåã§ããïŒ- ãã®ä»äžè¿°ã®ããŒã¿ã»ããäœæã®æ©èœè¿œå ã«äŒŽãCLIã®ãªãã·ã§ã³ã®è¿œå ïŒè©³ããã¯CLI.mdãåç §ïŒ
v2.3.1 (2024-02-27)
ãã°ä¿®æ£
- colabã®åŠç¿çšããŒãããã¯ãåããªãã£ãã®ãä¿®æ£
App.bat
ãserver_fastapi.py
ã§ã¯èªããªãæåã§ãŸã ãšã©ãŒãçºçããããã«ãªã£ãŠããã®ã§ãæšè«æã¯å¿ ãèªããªãæåãç¡èŠããŠåŒ·åŒã«èªãããã«æåãå€æŽ
æ¹å
- èªã¿ãååŸã§ããªãå Žåã«ãããã¹ãååŠçå®äºæã«ãšã©ãŒã§äžæããä»ãŸã§ã®æåã«å ããŠããèªã¿ååŸå€±æãã¡ã€ã«ãåŠç¿ã«äœ¿ããã«é²ããããããã¯ãèªããªãæåãç¡èŠããŠèªãã§ãã¡ã€ã«ãåŠç¿ã«äœ¿ãé²ããããšãããªãã·ã§ã³ãè¿œå ã
- ããŒãžæ¹æ³ã«ç·åœ¢è£éã®ä»ã«çé¢ç·åœ¢è£å®ãè¿œå ïŒ@frodo821 ããã«ããPRã§ããããããšãããããŸãïŒïŒ
- ãããã€çš
.dockerignore
ãæŽæ°
ã¢ããããŒãæé
- 2.3æªæºããã®ã¢ããããŒãã®å Žåã¯ãUpdate-to-Dict-Editor.batãããŠã³ããŒããã
Style-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ããã - 2.3ããã®ã¢ããããŒãã®å Žåã¯ãåçŽã«ä»ãŸã§ã®
Update-Style-Bert-VITS2.bat
ã§ã¢ããããŒãã§ããŸãã
v2.3 (2024-02-26)
倧ããªå€æŽ
倧ããå€æŽãããã€ããããããã¢ããããŒãã¯ãŸãå°çšã®æé ãå¿ èŠã§ããäžèšã®æ瀺ã«ãããã£ãŠãã ããã
ãŠãŒã¶ãŒèŸæžæ©èœ
ãããããèŸæžã«åºæåè©ãè¿œå ããããšãã§ãããããåŠç¿æã»é³å£°åææã®èªã¿ååŸéšåã«é©å¿ãããŸããèŸæžã®è¿œå ã»ç·šéã¯æ¬¡ã®ãšãã£ã¿çµç±ã§è¡ã£ãŠãã ããããŸãã¯ãææã¡ã®OpenJTalkã®csv圢åŒã®èŸæžãããå Žåã¯ãdict_data/default.csv
ãã¡ã€ã«ãçŽæ¥äžæžããè¿œå ããŠãå¯èœã§ãã
䜿ããããªèŸæžïŒã©ã€ã»ã³ã¹çã¯åèªã確èªãã ããïŒïŒä»ã«è¯ãã®ããã£ããæããŠäžããïŒïŒ
èŸæžæ©èœéšåã®å®è£ ã¯ãäžã®READMEã«ããéããVOICEVOX Editor ã®ãã®ã䜿ã£ãŠããããã®éšåã®ã³ãŒãã©ã€ã»ã³ã¹ã¯LGPL-3.0ã§ãã
é³å£°åæå°çšãšãã£ã¿
ð€ ãªã³ã©ã€ã³ãã¢ã¯ãã¡ããã
é³å£°åæå°çšãšãã£ã¿ãè¿œå ãä»ãŸã§ã®WebUIã§ã§ããæ©èœã®ã»ãã次ã®ãããªæ©èœã䜿ããŸãïŒã€ãŸãæ¢åã®æ¥æ¬èªé³å£°åæãœãããŠã§ã¢ã®ãšãã£ã¿ãç䌌ãŸããïŒïŒ
- ã»ãªãåäœã§ãã£ã©ãèšå®ãå€æŽããªããåçš¿ãäœãããããäžæ¬ã§çæããããåçš¿ãä¿åçãããèªã¿èŸŒãã ã
- GUIããåãããããã¢ã¯ã»ã³ã調æŽ
- ãŠãŒã¶ãŒèŸæžãžã®åèªè¿œå ãç·šé
Editor.bat
ãããã«ã¯ãªãã¯ãpython server_editor.py --inbrowser
ã§èµ·åããŸãããšãã£ã¿ãŒéšåã¯ãã¡ãã®å¥ãªããžããªã«ãªããŸããããã³ããšã³ãåå¿è
ãªã®ã§ãã«ãªã¯ãæ¹åæ¡çããåŸ
ã¡ããŠããŸãã
ãã°ä¿®æ£
- ç¹å®ã®ç¶æ³ã§èªã¿ãæ£ããååŸã§ãã
list index out of range
ãšãªããã°ã®ä¿®æ£ - ååŠçæã«ãæžãèµ·ãããã¡ã€ã«ã®ããè¡ã®åœ¢åŒãäžæ£ã ãšãæžãèµ·ãããã¡ã€ã«ã®ãã以éã®å 容ãæ¶ããŠããŸããã°ã®ä¿®æ£
- faster-whisperã1.0.0ã«ã¡ãžã£ãŒããŒãžã§ã³ã¢ããããïŒä»ã®ãšããïŒå€§å¹ ã«å£åããã®ã§ãããŒãžã§ã³ã0.10.1ãžåºå®
æ¹å
- ããã¹ãååŠçæã«ãèªã¿ã®ååŸã®å€±æçããã£ãå Žåã«ãåŠçãäžæããããšã©ãŒããããç®æã
text_error.log
ãã¡ã€ã«ãžä¿åããããã«å€æŽã - é³å£°åææã«ãèªããªãæåããã£ããšãã¯ãšã©ãŒãèµ·ãããããã®éšåãç¡èŠããŠèªã¿äžããããã«å€æŽïŒåŠç¿æ®µéã§ã¯ãšã©ãŒãåºããŸãïŒ
- ã³ãã³ãã©ã€ã³ã§ååŠçãåŠç¿ãç°¡åã«ã§ãããããååŠçãè¡ã
preprocess_all.py
ãè¿œå ïŒè©³ããã¯CLI.mdãåç §ïŒ - åŠç¿ã®éã«ãèªåçã«èªåã®hugging faceãªããžããªãžçµæãã¢ããããŒããããªãã·ã§ã³ãè¿œå ãã³ãã³ãã©ã€ã³åŒæ°ã§
--repo_id username/my_model
ã®ããã«æå®ããŠãã ããïŒè©³ããã¯CLI.mdãåç §ïŒãð€ã®ç¡å¶éã¹ãã¬ãŒãžã䜿ããã®ã§ã¯ã©ãŠãã§ã®åŠç¿ã«äŸ¿å©ã§ãã - åŠç¿æã«ãã³ãŒããŒéšåãåçµãããªãã·ã§ã³ã®è¿œå ãå質ãããããããäžãããããããŸããã
initialize.py
ã«åŒæ°--dataset_root
ãš--assets_root
ãè¿œå ããconfigs/paths.yml
ããã®æç¹ã§å€æŽã§ããããã«ãã
ãã®ä»
- paperspaceã§ã®åŠç¿ã®æåŒããè¿œå ãpaperspaceã§ã®imageã«äœ¿ããDockerfileãè¿œå
- CLIã§ã®åçš®åŠçã®å®è¡ã®ä»æ¹ãè¿œå
- Hugging Face spacesã§éã¹ãé³å£°åæãšãã£ã¿ããããã€ããããã®Dockerfileãè¿œå
ã¢ããããŒãæé
Update-to-Dict-Editor.batãããŠã³ããŒããã
Style-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ãããæåã§ã®å Žåã¯ã以äžã®æé ã§å®è¡ããŠãã ããïŒ
git pull
venv\Scripts\activate
pip uninstall pyopenjtalk-prebuilt
pip install -U -r requirements.txt
# python initialize.py # ããã1.xç³»ããã®ã¢ããããŒãã®å Žåã¯å®è¡ããŠãã ãã
python server_editor.py --inbrowser
æ°èŠã€ã³ã¹ããŒã«æé
ãã®zipãããŠã³ããŒããã解åããŠãã ããã
ãå±éããInstall-Style-Bert-VITS2.bat
ãããã«ã¯ãªãã¯ããŠãã ããã
v2.2 (2024-02-09)
å€æŽã»æ©èœè¿œå
- bfloat16ãªãã·ã§ã³ã¯ãã¡ãªããããç¡ããããªã®ã§ãåžžã«ãªãã§åŠç¿ããããå€æŽ
- ããããµã€ãºã®ããã©ã«ãã4ãã2ã«å€æŽãåŠç¿ãé ãå Žåã¯ããããµã€ãºãäžããŠè©ŠããŠã¿ãŠãVRAMã«äœè£ãããã°äžããŠãã ãããJP-Extra䜿çšæã§ã®ããããµã€ãºããšã®VRAM䜿çšéç®å®ã¯ã1: 6GB, 2: 8GB, 3: 10GB, 4: 12GB ãããã®ããã§ãã
- åŠç¿ã®éã®æ€èšŒããŒã¿æ°ãããã©ã«ãã§0ã«å€æŽãããŸãæ€èšŒããŒã¿æ°ãåŠç¿çšWebUIã§æå®ã§ããããã«ãã
- Tensorboardã®ãã°ééãåŠç¿çšWebUIã§æå®ã§ããããã«ãã
- UIã®ããŒãã
common/constants.py
ã®GRADIO_THEME
ã§æå®ã§ããããã«ãã
ãã°ä¿®æ£
- JP-Extra䜿çšæã«ããããµã€ãºã1ã ãšåŠç¿äžã«ãšã©ãŒãçºçãããã°ãä¿®æ£
- ãããã«ã¡ã¯!?!?!?!?ãçãæå笊çã®èšå·ãé£ç¶ãããšåŠç¿ã»é³å£°åæã§ãšã©ãŒã«ãªããã°ãä¿®æ£
â
(em dash, U+2014) ãâ
(quotation dash, U+2015) çã®ããã·ã¥ããã€ãã³ã®åçš®å€çš®ããçš®é¡ã«ãã£ãŠ-
ïŒéåžžã®åè§ãã€ãã³ïŒã«æ£èŠåãããããããŠããªãã£ããããåŠçããå šãŠæ£èŠåããããã«ä¿®æ£
v2.1 (2024-02-07)
å€æŽ
- åŠç¿ã®éãããã©ã«ãã§ã¯bfloat16ãªãã·ã§ã³ã䜿ããªãããå€æŽïŒåŠç¿ãçºæ£ããã質ãäžããããšãããæš¡æ§ïŒ
- åŠç¿ã®éã®ã¡ã¢ãªäœ¿çšéãåæžããããšé 匵ã£ã
ãã°ä¿®æ£ãæ¹å
- åŠç¿WebUIããTensorboardã®ãã°ãèŠããããã«
- é³å£°åæïŒããã®APIïŒã«ãããŠãåæã«å¥ã®è©±è ãéžæããé³å£°åæããªã¯ãšã¹ããããå Žåã«çºçãããšã©ãŒãä¿®æ£
- ã¢ãã«ããŒãžæã«ããã®ã¬ã·ãã
recipe.json
ãã¡ã€ã«ãžä¿åããããã«å€æŽ - ãæ¹è¡ã§åããŠçæããããææ ãä¹ãæšã®æèšçã軜埮ãªèª¬ææã®æ¹å
- ã
ãŒãŒããã¯é¢çœã
ããããªãã»ã©ããŒãŒãŒããããããšãã
ãçãé·é³èšå·ã®åãæ¯é³ã§ãªãå Žåãé·é³èšå·ãŒ
ã§ãªãããã·ã¥â
ã®åéãã ãšæãããã®ã§ãããã·ã¥èšå·ãšããŠåŠçããããã«å€æŽ
v2.0.1 (2024-02-05)
軜埮ãªãã°ä¿®æ£ãæ¹å
- ã¹ã¿ã€ã«ãã¯ãã«ã«
NaN
ãå«ãŸããŠããå ŽåïŒäž»ã«é³å£°ãã¡ã€ã«ã極端ã«çãå Žåã«çºçïŒããããåŠç¿ãªã¹ãããé€å€ããããã«ä¿®æ£ - colabã«ããŒãžã®è¿œå
- åŠç¿æã®ããã°ã¬ã¹ããŒã®è¡šç€ºãããããã£ãã®ãä¿®æ£
- ããã©ã«ãã®jvnvã¢ãã«ãJP-Extraçã«ã¢ããããŒããæ°ããã¢ãã«ã䜿ãããæ¹ã¯æåã§ãã¡ãããããŠã³ããŒããããã
python initialize.py
ããããããã®batãã¡ã€ã«ãStyle-Bert-VITS2
ãã©ã«ããããå ŽæïŒã€ã³ã¹ããŒã«batãã¡ã€ã«ãšãããã£ããšããïŒã«ãããŠããã«ã¯ãªãã¯ããŠãã ããã
v2.0 (2024-02-03)
倧ããå€æŽ
ã¢ãã«æ§é ã« Bert-VITS2ã®æ¥æ¬èªç¹åã¢ãã« JP-Extra ãåã蟌ãã ãã®ã䜿ããããã«å€æŽãäºååŠç¿ã¢ãã«ãBert-VITS2 JP-Extraã®ãã®ãæ¹é ããŠStyle-Bert-VITS2ã§äœ¿ããããã«ããŸãã (ã¢ãã«æ§é ãèŠçŽããŠæ¥æ¬èªã§ã®åŠç¿ãããŠããã ãã @Stardust-minus æ§ã«æè¬ããŸã)
- ããã«ãããæ¥æ¬èªã®çºé³ãã¢ã¯ã»ã³ããææãèªç¶æ§ãåäžããåŸåããããŸã
- ã¹ã¿ã€ã«ãã¯ãã«ã䜿ã£ãã¹ã¿ã€ã«ã®æäœã¯å€ããã䜿ããŸã
- ãã ãJP-Extraã§ã¯è±èªãšäžåœèªã®é³å£°åæã¯ïŒçŸç¶ã¯ïŒã§ããŸãã
- æ§ã¢ãã«ãåŒãç¶ã䜿ãããšãã§ãããŸãæ§ã¢ãã«ã§åŠç¿ããããšãã§ããŸã
- ããã©ã«ãã®JVNVã¢ãã«ã¯çŸåšã¯æ§verã®ãŸãŸã§ã
æ¹å
Merge.bat
ã§ã声é³ããŒãžãããã现ããã声質ããšã声ã®é«ããã®ç¹ã§ããŒãžã§ããããã«ã
ãã°ä¿®æ£
- PyTorchã®ããŒãžã§ã³ã«ç±æ¥ãããã°ãä¿®æ£ïŒtorchã®ããŒãžã§ã³ã2.1.2ã«åºå®ïŒ
â
ïŒããã·ã¥ãé·é³èšå·ã§ã¯ãªãïŒã2é£ç¶ãããšåŠç¿ã»é³å£°åæã§ãšã©ãŒã«ãªããã°ãä¿®æ£- ãäžåãçããïŒæ¯é³ãã®ã¢ã¯ã»ã³ãã®ä»®åè¡šèšãããµãã³ãçã«ãªãããŸãå¶ã«ãšã©ãŒãçºçããåé¡ãä¿®æ£ïŒãããã®é³çŽ è¡šèšãå éšçã«ã¯ãNãã§çµ±äžïŒ
v1.3 (2024-01-09)
倧ããå€æŽ
- å
ã
ã®Bert-VITS2ã«ååšãããæ¥æ¬èªã®çºé³ã»ã¢ã¯ã»ã³ãåŠçéšåã®ãã°ãä¿®æ£ã»ãªãã¡ã¯ã¿ãªã³ã°
è»äž¡
ãã·ã£ãªãšãª
ãæã
ããªã¢ãª
ãèŠã€ãã
ãããã±ã«
çã«çºé³ã»åŠç¿ãããŠããããã®åèªä»¥éã®ã¢ã¯ã»ã³ãæ å ±ãå šãŠæ»ãã§ããç§ã¯ãããèŠã
ã®ã¢ã¯ã»ã³ããã¯âã¿ã·âã¯ããœâã¬âãªããâã«
ã ã£ãã®ãã¯âã¿ã·ã¯ããœâã¬ãªããâã«
ã«ä¿®æ£- åŠç¿ã»é³å£°åæã§ç¡èŠãããŠããã¢ã«ãã¡ãããã»ã®ãªã·ã£æåãç¡èŠããªãããã«å€æŽïŒåºæ¬ã¯ã¢ã«ãã¡ãããèªã¿ã ãã©ç°¡åãªåèªã¯èªããããããåŠç¿ã®éã¯å¿µã®ããã«ã¿ã«ãçã«ããã»ããããã§ãïŒ
- ä¿®æ£ã®åœ±é¿ã§ãååŠçæã«ïŒä»ãŸã§ç¡èŠãããŠããïŒèªããªã挢åçã§åŒã£ãããããã«ãªããŸããããã®å Žåã¯æžãèµ·ããã確èªããŠä¿®æ£ããããã«ããŠãã ããã
- ã¢ã¯ã»ã³ãã調æŽããŠé³å£°åæã§ããããã«ïŒå®å šã«å¶åŸ¡ã§ããããã§ã¯ãªããæ¹åãããå ŽåãããïŒã
ãããŸã§ã®ã¢ãã«ããããŸã§éã䜿ããã¢ã¯ã»ã³ããçºé³çãæ¹åãããå¯èœæ§ããããŸããæ°ããããŒãžã§ã³ã§åŠç¿ãçŽããšããè¯ããªãå¯èœæ§ããããŸãããåçã«è¯ããªããã¯åãããŸããã
æ¹å
Dataset.bat
ã®é³å£°ã¹ã©ã€ã¹ãšæžãèµ·ãããããã«ã¹ã¿ãã€ãºã§ããããã«ïŒã¹ã©ã€ã¹ã®ç§æ°èšå®ãæžãèµ·ããã®Whisperã¢ãã«æå®ãèšèªæå®çïŒStyle.bat
ã®ã¹ã¿ã€ã«åãã§ãã¹ã¿ã€ã«ããšã®ãµã³ãã«é³å£°ãæå®ããæ°ã ãè€æ°åçã§ããããã«ããŸãæ°ãã次å åæžæ¹æ³ïŒUMAPïŒãšæ°ããã¹ã¿ã€ã«åãã®æ¹æ³ïŒDBSCANïŒãè¿œå ïŒUMAPã®ã»ããããã¹ã¿ã€ã«ãåããããããããŸããïŒApp.bat
ã§ã®é³å£°åææã«è€æ°è©±è ã¢ãã«ã®å Žåã«è©±è ãæå®ã§ããããã«- colabã®ããŒãããã¯ã§ãé³å£°ãã¡ã€ã«ã®ã¿ããããŒã¿ã»ãããäœæãããªãã·ã§ã³éšåãè¿œå
- ã¯ã©ãŠãå®è¡çã®éã«ãã¹ã®æå®ããã¡ãã§ã§ããããã«ããã¹ã®èšå®ã
configs/paths.yml
ã«ãŸãšããïŒcolabã®ããŒãããã¯ãããã«äŒŽã£ãŠæŽæ°ïŒãããã©ã«ãã¯dataset_root: Data
ãšassets_root: model_assets
ãªã®ã§ãã¯ã©ãŠãçã§ããæ¹ã¯ãããå€æŽããŠãã ããã - ã©ã®ã¹ãããæ°ã®åºåããããã®ãäžã€ã®ãææšãšã㊠SpeechMOS ã䜿ãã¹ã¯ãªãããè¿œå ïŒ
python speech_mos.py -m <model_name>
ã¹ãããããšã®èªç¶æ§è©äŸ¡ã衚瀺ãããmos_results
ãã©ã«ãã®mos_{model_name}.csv
ãšmos_{model_name}.png
ã«çµæãä¿åããããèªã¿äžãããããæç« ãå€ãããã£ããäžã®ãã¡ã€ã«ãåŒã£ãŠåèªèª¿æŽããŠãã ããããããŸã§ã¢ã¯ã»ã³ããææ
è¡šçŸãææãå
šãèããªãåºæºã§ã®è©äŸ¡ã§ãç®å®ã®ã²ãšã€ãªã®ã§ãå®éã«èªã¿äžããããŠéžå¥ããã®ãäžçªã ãšæããŸãã
- åŠç¿æã®ãŠã©ãŒã ã¢ãããªãã·ã§ã³ãæ©èœããããã«ïŒ @kale4eat æ§ã«ããPRã§ããããããšãããããŸãïŒïŒãååŠçæã«çæããã
config.json
ã®train
ã®warmup_epochs
ãå€æŽããããšã§ããŠã©ãŒã ã¢ããã®ãšããã¯æ°ãå€æŽã§ããŸããããã©ã«ãã¯0
ã§ä»ãŸã§ãšåãåŠç¿çã®æåã§ãã
ãã®ä»
Dataset.bat
ã®é³å£°ã¹ã©ã€ã¹ã§ããŒãã©ã€ãºæ©èœãåé€ïŒåŠç¿ååŠçã§è¡ããããïŒTrain.bat
ã®é³éããŒãã©ã€ãºãšç¡é³åãè©°ããããã©ã«ãã§ãªãã«å€æŽ- åŠç¿æã®é²æãå šäœãšããã¯æ°ã§è¡šç€ºããåŠç¿å šäœã®é²æãèŠãããããã«( @RedRayz æ§ã«ããPRã§ããããããšãããããŸãïŒ)
- ãã®ä»ãã°ä¿®æ£çïŒ @tinjyuu æ§ã @darai0512 æ§ããããšãããããŸãïŒïŒ
config.json
ã«ã¹ã¿ã€ã«åã蟌ã¿éšåãåŠç¿ããªãfreeze_style
ãªãã·ã§ã³ãè¿œå ïŒããã©ã«ãã¯false
ïŒ
TIPS
- æ¥æ¬èªåŠç¿ã®å Žåã
config.json
ã®freeze_bert
ãšfreeze_en_bert
ãtrue
ã«ããŠãããšãè±èªãšäžåœèªã®çºè©±èœåãåŠç¿ã®éçšã§èœã¡ãªããããããŸããããããŸãæ¯èŒããŠããªã®ã§åãããŸããã
v1.2 (2023-12-31)
- ã°ã©ãããªããŠãŒã¶ãŒã§ã®é³å£°åæããµããŒãã
Install-Style-Bert-VITS2-CPU.bat
ã§ã€ã³ã¹ããŒã«ã - Google Colabã§ã®åŠç¿ããµããŒããããŒãããã¯ãè¿œå
- é³å£°åæã®APIãµãŒããŒãè¿œå ã
python server_fastapi.py
ã§èµ·åããŸããAPIä»æ§ã¯èµ·ååŸã«/docs
ã«ãŠç¢ºèªãã ãããïŒ @darai0512 æ§ã«ããPRã§ããããããšãããããŸãïŒïŒ - åŠç¿æã«èªåçã«ããã©ã«ãã¹ã¿ã€ã« Neutral ãçæããããã«ãç¹ã«ã¹ã¿ã€ã«æå®ãå¿ èŠã®ãªãæ¹ã¯ãåŠç¿ããããã®ãŸãŸé³å£°åæãè©ŠããŸãããããŸã§éãã¹ã¿ã€ã«ãèªåã§äœãããšãã§ããŸãã
- ããŒãžæ©èœã®æ°èŠè¿œå :
Merge.bat
,webui_merge.py
- ååŠçã®ãªãµã³ããªã³ã°æã«é³å£°ãã¡ã€ã«ã®éå§ã»çµäºéšåã®ç¡é³ãåé€ãããªãã·ã§ã³ãè¿œå ïŒããã©ã«ãã§ãªã³ïŒ
ã¹ã¿ã€ã«ããã¹ã (style text)
ãã¹ã¿ã€ã«æå®ãšçŽããããã£ãã®ã§ãã¢ã·ã¹ãããã¹ã (assist text)
ã«å€æŽ- ãã®ä»ã³ãŒãã®ãªãã¡ã¯ã¿ãªã³ã°
v1.1 (2023-12-29)
- TrainãšDatasetã®WebUIã®æ¹è¯ã»èª¿æŽïŒäžæ¬äºååŠçãã¿ã³çïŒ
- ååŠçã®ãªãµã³ããªã³ã°æã«é³éãæ£èŠåãããªãã·ã§ã³ãè¿œå ïŒããã©ã«ãã§ãªã³ïŒ
v1.0 (2023-12-27)
- åç