File size: 1,276 Bytes
a89c984 41ad9f8 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 |
---
license: openrail
pipeline_tag: audio-to-audio
tags:
- pretrained
- RVC
- ai
- voice-cloning
- voice-conversion
- Voice2Voice
---
# DMR: Deep and Soft Voice Improvement Pretrain for RVC
## Model description
DMR is a pretrain designed to improve soft and deep voices.
## Intended uses
- Improve voice conversion quality, especially for soft and deep voices
- Enhance breathing sounds (and eventually whispering) in voice conversion
- Make better E-girl and Mommy voices :3
## Training data
The model was trained on a custom dataset with the following details:
- Total duration: 11.3 hours
- Language: English
- Number of speakers: 22
- 16 female speakers
- 6 male speakers
## Training Process
DMR was trained with Applio using a RTX 4060TI 16gb.
- BatchSize: 8
- Pitch Extraction Method: Mangio-Crepe
- Hop Length: 32
- Sample Rate: 32K
## Usage
To use the DMR pretrain:
1. Download both the D and G files of the DMR model.
2. For standard RVC setup:
- Place the downloaded files in the `pretrained_v2` folder.
3. For Applio users:
- Place the downloaded files in the `custom pretrains` folder.
## Additional Information
I do plan to make a V2 of DMR with around 30 hours of speech using either BIGVGAN V2 or EVAGAN but I do not have a release date. |