Razer112 commited on
Commit
41ad9f8
1 Parent(s): 660094d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +43 -1
README.md CHANGED
@@ -9,4 +9,46 @@ tags:
9
  - voice-conversion
10
  - Voice2Voice
11
  ---
12
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  - voice-conversion
10
  - Voice2Voice
11
  ---
12
+
13
+ # DMR: Deep and Soft Voice Improvement Pretrain for RVC
14
+
15
+ ## Model description
16
+
17
+ DMR is a pretrain designed to improve soft and deep voices.
18
+
19
+ ## Intended uses
20
+
21
+ - Improve voice conversion quality, especially for soft and deep voices
22
+ - Enhance breathing sounds (and eventually whispering) in voice conversion
23
+ - Make better E-girl and Mommy voices :3
24
+
25
+ ## Training data
26
+
27
+ The model was trained on a custom dataset with the following details:
28
+ - Total duration: 11.3 hours
29
+ - Language: English
30
+ - Number of speakers: 22
31
+ - 16 female speakers
32
+ - 6 male speakers
33
+
34
+ ## Training Process
35
+
36
+ DMR was trained with Applio using a RTX 4060TI 16gb.
37
+ - BatchSize: 8
38
+ - Pitch Extraction Method: Mangio-Crepe
39
+ - Hop Length: 32
40
+ - Sample Rate: 32K
41
+
42
+ ## Usage
43
+
44
+ To use the DMR pretrain:
45
+
46
+ 1. Download both the D and G files of the DMR model.
47
+ 2. For standard RVC setup:
48
+ - Place the downloaded files in the `pretrained_v2` folder.
49
+ 3. For Applio users:
50
+ - Place the downloaded files in the `custom pretrains` folder.
51
+
52
+ ## Additional Information
53
+
54
+ I do plan to make a V2 of DMR with around 30 hours of speech using either BIGVGAN V2 or EVAGAN but I do not have a release date.