blaise-tk commited on
Commit
97a3660
โ€ข
1 Parent(s): a53bc97

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +80 -46
README.md CHANGED
@@ -1,31 +1,62 @@
1
- # Applio
2
-
3
- Welcome to **Applio**, the ultimate voice cloning tool meticulously optimized for unrivaled power, modularity, and a user-friendly experience.
4
-
5
- [![๐Ÿ Applio Plugins](https://img.shields.io/badge/%F0%9F%8D%8F%20Applio%20Plugins-ffffff?style=flat-square&link=https://github.com/IAHispano/Applio-Plugins)](https://github.com/IAHispano/Applio-Plugins)
6
- [![Precompiled Versions](https://img.shields.io/badge/Precompiled%20Versions-ffffff?style=flat-square&logo=data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAA4AAAAOCAYAAAAfSC3RAAAAAXNSR0IArs4c6QAAAARnQU1BAACxjwv8YQUAAAAJcEhZcwAALEsAACxLAaU9lqkAAAHzSURBVDhPrVFNaBNBFP5m9idJU9tE2srS5KSNlxT00AhSEEH01GMOIkpPBSmCehE99aIgVOrJgzdP6kE8qoQKEqmIV00PhRQ3jSbdTbRVm7rbnfHNJLV4lX7w3sx8733z3szDvkIIkQyCoEB2otls9vfof8B6q0a9Xu8b2V66bTSez7BguU+FhT3eQfb8Q549d4sxttVL3RNKKS3xaeEVX797GkOxHkuQZF4Ikb3zZjmcOJvP5wNFc+UUwrXFWd6Y1yK3YWlOadx12o9Y4O7cqZyDKzpA2BUy7j6bRprBWwsw/yTdLfTNwL3HaeBnB0gF4KtPp1WuEuwJ2cYYLAPDjokbF9rUOpA+IHDzYhtIUOsxCyxqjalcJdDCYrHIpLA7aPwAOiEW38dx+XoGM9cy+Lhiak7HWKKjcpVGu99+5ZL9buoR7BY9NkJE98nPo5AxCXP0C1USgGkAO8MIT76ctQ8efqAr8u+VY0ATMOg4EIcxYMMc92DlfLCkrTmYFBNfgY3Kca1RDiZdLah4PyWlEsAg2eZ219RecSomGZgRV//WbbVWq2UObb29b7RfT/FwxQZPIUoWlqQUkfHrwySTm0zauXBn6MyLVnLyquM4q0q3C14qlQY9z5uoVqtH6UyPovG57hHf9wvlcpnm8ncK/wvgD6Orstc1XrkKAAAAAElFTkSuQmCC&link=https://huggingface.co/IAHispano/applio/tree/main/Applio%20V3%20Precompiled)](https://huggingface.co/IAHispano/applio/tree/main/Applio%20V3%20Precompiled)
7
- ![GitHub Release](https://img.shields.io/github/v/release/iahispano/applio-rvc-fork?style=flat-square)
8
- ![GitHub Repo stars](https://img.shields.io/github/stars/iahispano/applio-rvc-fork?style=flat-square)
9
- ![GitHub forks](https://img.shields.io/github/forks/iahispano/applio-rvc-fork?style=flat-square)
10
- [![Support Discord](https://img.shields.io/discord/1096877223765606521?style=flat-square)](https://discord.gg/iahispano)
11
- [![Issues](https://img.shields.io/github/issues/iahispano/applio-rvc-fork?style=flat-square)](https://github.com/IAHispano/Applio-RVC-Fork/issues)
12
- [![Open In Collab](https://img.shields.io/badge/google_colab-F9AB00?style=flat-square&logo=googlecolab&logoColor=white)](https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
 
14
  ## Content Table
15
  - [**Installation**](#installation)
16
  - [Windows](#windows)
17
  - [Linux](#linux)
18
- - [Using Makefile](#using-makefile-for-platforms-such-as-paperspace)
19
  - [**Usage**](#usage)
20
  - [Windows](#windows-1)
21
  - [Linux](#linux-1)
22
- - [Using Makefile](#using-makefile-for-platforms-such-as-paperspace-1)
23
  - [**Repository Enhancements**](#repository-enhancements)
24
- - [**Credits**](#credits)
25
  - [Contributors](#contributors)
26
 
27
  ## Installation
28
- Download the latest version from [GitHub Releases](https://github.com/IAHispano/Applio-RVC-Fork/releases) or use [Precompiled Versions](https://huggingface.co/IAHispano/applio/tree/main/Applio%20V3%20Precompiled).
29
 
30
  ### Windows
31
  ```bash
@@ -33,12 +64,14 @@ Download the latest version from [GitHub Releases](https://github.com/IAHispano/
33
  ```
34
 
35
  ### Linux
 
36
  ```bash
37
  chmod +x run-install.sh
38
  ./run-install.sh
39
  ```
40
 
41
- ### Using Makefile (for platforms such as [Paperspace](https://www.paperspace.com/))
 
42
  ```
43
  make run-install
44
  ```
@@ -57,21 +90,40 @@ chmod +x run-applio.sh
57
  ./run-applio.sh
58
  ```
59
 
60
- ### Using Makefile (for platforms such as [Paperspace](https://www.paperspace.com/))
 
61
  ```
62
  make run-applio
63
  ```
64
 
65
  ## Repository Enhancements
66
 
67
- This repository has undergone significant improvements to enhance its functionality and maintainability:
68
-
69
- - **Code Modularization:** The codebase has been restructured to follow a modular approach. This ensures better organization, readability, and ease of maintenance.
70
- - **Hop Length Implementation:** Special thanks to [@Mangio621](https://github.com/Mangio621/Mangio-RVC-Fork) for introducing hop length implementation. This enhancement enhances the efficiency and performance on Crepe (previously known as Mangio-Crepe).
71
- - **Translations to +30 Languages:** The repository now supports translations in over 30 languages, making it more accessible to a global audience.
72
- - **Cross-Platform Compatibility:** With multiplatform compatibility, this repository can seamlessly operate across various platforms, providing a consistent experience to users.
73
- - **Optimized Requirements:** The project's requirements have been fine-tuned for improved performance and resource utilization.
74
- - **Simple Installation:** The installation process has been streamlined, ensuring a straightforward and user-friendly experience for setup.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
75
 
76
  These enhancements contribute to a more robust and scalable codebase, making the repository more accessible for contributors and users alike.
77
 
@@ -81,29 +133,11 @@ These enhancements contribute to a more robust and scalable codebase, making the
81
 
82
  We appreciate all contributions โค๏ธ
83
 
84
- ## Planned Features
85
- - Implement: Support for Apple Devices ([Issue Link](https://github.com/pytorch/pytorch/issues/77764))
86
- - Implement: rmvpe_gpu
87
- - Implement: Overtraining detector
88
- - Implement: Autotune
89
- - Implement: Training stop
90
- - Fix: Model fusion
91
-
92
- ## Credits
93
- - [VITS](https://github.com/jaywalnut310/vits) by jaywalnut310
94
- - [RMVPE](https://github.com/Dream-High/RMVPE) by Dream-High
95
- - [ContentVec](https://github.com/auspicious3000/contentvec/) by auspicious3000
96
- - [HIFIGAN](https://github.com/jik876/hifi-gan) by jik876
97
- - [Gradio](https://github.com/gradio-app/gradio) by gradio-app
98
- - [FFmpeg](https://github.com/FFmpeg/FFmpeg) by FFmpeg
99
- - [audio-slicer](https://github.com/openvpi/audio-slicer) by openvpi
100
- - [Ilaria-Audio-Analyzer](https://github.com/TheStingerX/Ilaria-Audio-Analyzer) by TheStingerX
101
  - [gradio-screen-recorder](https://huggingface.co/spaces/gstaff/gradio-screen-recorder) by gstaff
102
  - [RVC_CLI](https://github.com/blaise-tk/RVC_CLI) by blaise-tk
103
- - [Retrieval-based-Voice-Conversion-WebUI](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI) by RVC-Project
104
- - [Mangio-RVC-Fork](https://github.com/Mangio621/Mangio-RVC-Fork) by Mangio621
105
 
106
  ### Contributors
107
  <a href="https://github.com/IAHispano/Applio/graphs/contributors" target="_blank">
108
  <img src="https://contrib.rocks/image?repo=IAHispano/Applio" />
109
- </a>
 
1
+ ---
2
+ pipeline_tag: audio-to-audio
3
+ tags:
4
+ - AI
5
+ - RVC
6
+ - VITS
7
+ - VC
8
+ - Voice Conversion
9
+ - Voice2Voice
10
+ ---
11
+ <h1 align="center">
12
+ <a href="https://applio.org" target="_blank"><img src="https://github.com/IAHispano/Applio/assets/133521603/a5cc5c72-ed68-48a5-954f-db9f1dc4e7de" alt="Applio"></a>
13
+ </h1>
14
+
15
+ <p align="center">
16
+ <img alt="Contributors" src="https://img.shields.io/github/contributors/iahispano/applio?style=for-the-badge&color=00AA68" />
17
+ <img alt="Release" src="https://img.shields.io/github/release/iahispano/applio?style=for-the-badge&color=00AA68" />
18
+ <img alt="Stars" src="https://img.shields.io/github/stars/iahispano/applio?style=for-the-badge&color=00AA68" />
19
+ <img alt="Fork" src="https://img.shields.io/github/forks/iahispano/applio?style=for-the-badge&color=00AA68" />
20
+ <img alt="Issues" src="https://img.shields.io/github/issues/iahispano/applio?style=for-the-badge&color=00AA68" />
21
+ </p>
22
+
23
+ <p align="center">VITS-based Voice Conversion focused on simplicity, quality and performance</p>
24
+
25
+ <p align="center">
26
+ <a href="https://applio.org" target="_blank">๐ŸŒ Website</a>
27
+ โ€ข
28
+ <a href="https://docs.applio.org" target="_blank">๐Ÿ“š Documentation</a>
29
+ โ€ข
30
+ <a href="https://discord.gg/iahispano" target="_blank">โ˜Ž๏ธ Discord</a>
31
+ </p>
32
+
33
+ <p align="center">
34
+ <a href="https://github.com/IAHispano/Applio-Plugins" target="_blank">๐Ÿ›’ Plugins</a>
35
+ โ€ข
36
+ <a href="https://huggingface.co/IAHispano/Applio/tree/main/Compiled" target="_blank">๐Ÿ“ฆ Compiled</a>
37
+ โ€ข
38
+ <a href="https://applio.org/playground" target="_blank">๐ŸŽฎ Playground</a>
39
+ โ€ข
40
+ <a href="https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio.ipynb" target="_blank">๐Ÿ”Ž Google Colab (UI)</a>
41
+ โ€ข
42
+ <a href="https://colab.research.google.com/github/iahispano/applio/blob/master/assets/Applio_NoUI.ipynb" target="_blank">๐Ÿ”Ž Google Colab (No UI)</a>
43
+ </p>
44
 
45
  ## Content Table
46
  - [**Installation**](#installation)
47
  - [Windows](#windows)
48
  - [Linux](#linux)
49
+ - [Makefile](#makefile)
50
  - [**Usage**](#usage)
51
  - [Windows](#windows-1)
52
  - [Linux](#linux-1)
53
+ - [Makefile](#makefile-1)
54
  - [**Repository Enhancements**](#repository-enhancements)
55
+ - [**References**](#references)
56
  - [Contributors](#contributors)
57
 
58
  ## Installation
59
+ Download the latest version from [GitHub Releases](https://github.com/IAHispano/Applio-RVC-Fork/releases) or use the [Compiled Versions](https://huggingface.co/IAHispano/Applio/tree/main/Compiled).
60
 
61
  ### Windows
62
  ```bash
 
64
  ```
65
 
66
  ### Linux
67
+ Certain Linux-based operating systems may encounter complications with the installer. In such instances, we suggest installing the `requirements.txt` within a Python environment version 3.9 to 3.11.
68
  ```bash
69
  chmod +x run-install.sh
70
  ./run-install.sh
71
  ```
72
 
73
+ ### Makefile
74
+ For platforms such as [Paperspace](https://www.paperspace.com/)
75
  ```
76
  make run-install
77
  ```
 
90
  ./run-applio.sh
91
  ```
92
 
93
+ ### Makefile
94
+ For platforms such as [Paperspace](https://www.paperspace.com/)
95
  ```
96
  make run-applio
97
  ```
98
 
99
  ## Repository Enhancements
100
 
101
+ This repository has undergone significant enhancements to improve its functionality and maintainability:
102
+
103
+ - **Modular Codebase:** Restructured codebase following a modular approach for better organization, readability, and maintenance.
104
+ - **Hop Length Implementation:** Implemented hop length, courtesy of [@Mangio621](https://github.com/Mangio621/Mangio-RVC-Fork), boosting efficiency and performance, especially on Crepe (formerly Mangio-Crepe).
105
+ - **Translations in 30+ Languages:** Added support for translations in over 30 languages, enhancing accessibility for a global audience.
106
+ - **Cross-Platform Compatibility:** Ensured seamless operation across various platforms for a consistent user experience.
107
+ - **Optimized Requirements:** Fine-tuned project requirements for enhanced performance and resource efficiency.
108
+ - **Streamlined Installation:** Simplified installation process for a user-friendly setup experience.
109
+ - **Hybrid F0 Estimation:** Introduced a personalized 'hybrid' F0 estimation method utilizing nanmedian, combining F0 calculations from various methods to achieve optimal results.
110
+ - **Easy-to-Use UI:** Implemented a user-friendly interface for intuitive interaction.
111
+ - **Optimized Code & Dependencies:** Enhanced code and streamlined dependencies for improved efficiency.
112
+ - **Plugin System:** Introduced a plugin system for extending functionality and customization.
113
+ - **Overtraining Detector:** Implemented an overtraining detector which halts training once a specified epoch limit is reached, preventing excessive training.
114
+ - **Model Search:** Integrated a model search feature directly into the application interface, facilitating easy model discovery.
115
+ - **Enhancements in Pretrained Models:** Introduced additional functionalities such as custom pretrained models, allowing users to utilize their preferred pretrained models without requiring RVC1 pretrained models upon installation.
116
+ - **Voice Blender:** Developed a voice blender feature that combines two trained models to create a new one, offering versatility in model generation.
117
+ - **Accessibility Improvements:** Enhanced accessibility with descriptive tooltips indicating the function of each element in the user interface, making it more user-friendly for all users.
118
+ - **New F0 Extraction Methods:** Introduced new F0 extraction methods such as FCPE or Hybrid, expanding options for pitch extraction.
119
+ - **Output Format Selection:** Implemented an output format selection feature, allowing users to choose the format in which they want to save their audio files.
120
+ - **Hashing System:** Implemented a hashing system where each created model is assigned a unique ID to prevent unauthorized duplication or theft.
121
+ - **Model Download System:** Added support for downloading models from various websites such as Google Drive, Yandex, Pixeldrain, Discord, Hugging Face, or Applio.org, enhancing model accessibility.
122
+ - **TTS Enhancements:** Improved Text-to-Speech functionality with support for uploading TXT files, increasing flexibility in input methods.
123
+ - **Split Audio:** Implemented audio splitting functionality which divides audio into segments for inference, subsequently merging them to create the final audio, resulting in faster processing times and potentially better outcomes.
124
+ - **Discord Presence:** Displayed presence on Discord indicating active usage of Applio, with plans to incorporate different statuses based on activities within the application.
125
+ - **Flask Integration:** Integration with Flask, initially disabled by default, allows for automatic model downloads from the web by simply clicking the Applio button next to the model download button in the settings tab.
126
+ - **Support Tab:** Added a support tab enabling users to record their screen to demonstrate encountered issues, facilitating faster issue resolution by allowing users to create GitHub issues for review and troubleshooting.
127
 
128
  These enhancements contribute to a more robust and scalable codebase, making the repository more accessible for contributors and users alike.
129
 
 
133
 
134
  We appreciate all contributions โค๏ธ
135
 
136
+ ## References
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
137
  - [gradio-screen-recorder](https://huggingface.co/spaces/gstaff/gradio-screen-recorder) by gstaff
138
  - [RVC_CLI](https://github.com/blaise-tk/RVC_CLI) by blaise-tk
 
 
139
 
140
  ### Contributors
141
  <a href="https://github.com/IAHispano/Applio/graphs/contributors" target="_blank">
142
  <img src="https://contrib.rocks/image?repo=IAHispano/Applio" />
143
+ </a>