ArkanDash commited on
Commit
a46e4ec
1 Parent(s): b418300

feat: added 2 model

Browse files
README.md CHANGED
@@ -8,64 +8,58 @@ pipeline_tag: audio-to-audio
8
  tags:
9
  - rvc
10
  ---
11
- # <center> RVC Genshin Impact Japanese Voice Model<br />
12
  ![model-cover.png](https://huggingface.co/ArkanDash/rvc-genshin-impact/resolve/main/model-cover.png)
13
 
14
- # About Retrieval based Voice Conversion (RVC)
15
- Learn more about Retrieval based Voice Conversion in this link below:<br />
16
  [RVC WebUI](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
17
 
18
- # How to use?
19
  Download the prezipped model and put to your RVC Project.
20
 
21
  Model test: [Google Colab](https://colab.research.google.com/drive/110kiMZTdP6Ri1lY9-NbQf17GVPPhHyeT?usp=sharing) / [RVC Models New](https://huggingface.co/spaces/ArkanDash/rvc-models-new) (Which is basically the same but hosted on spaces)
22
 
23
-
24
- ## <center> INFO <br />
25
- Model Created by ArkanDash <br />
26
  The voice that was used in this model belongs to Hoyoverse.
 
 
 
 
27
 
28
- The voice I make to make this model was ripped from the game (3.7). <br />
29
- Total Models: 43 Models
30
-
31
- V1 Models: 19 <br />
32
- V2 Models: 24 <br />
33
-
34
- Duplicate: <br />
35
  - Zhongli (v1 & v2)
36
  - Nahida (v1 & v2)
37
 
38
- Plans: <br />
39
- - Nothing now.
 
40
 
41
- Replace: <br />
42
- - Raiden Shogun model is now replaced with newer dataset due to bad voice from older model, The old model is now deleted.
 
 
 
43
 
44
  Have a request?
45
  I accept genshin character request if you want it.
46
- For Honkai Star Rail coming soon...
 
 
47
 
48
- ### V1 Model <br />
49
- This was trained on Original RVC.<br />
50
- Pitch Extract using Harvest.<br />
51
- This model was trained with 100 epochs, 10 batch sizes, and a 40K sample rate (some models had a 48k sample rate).<br />
52
- Every V1 model was trained more or less around 30 minutes of character voice.
53
 
54
- I may exclude some models to higher epochs due to the low duration of the character's voice.<br />
55
- - Klee 150 Epochs
56
- - Fischl 150 Epochs
 
57
 
58
- ### (New) V2 Model <br />
59
- This was trained on Mangio-Fork RVC.<br />
60
- Pitch Extract using Crepe.<br />
61
- This model was trained with 100 epochs, 8 batch sizes, and a 48K sample rate. (some models had a 40k sample rate).<br />
62
- Every V2 model was trained more or less around 60 minutes of character voice.
63
 
64
- Other request:<br />
65
- - Greater Lord Rukkhadevata: 750 Epochs, 16 Batch size, 48k Sample rate. (10 minutes dataset)
66
- - Charlotte: 400 Epochs, 16 Batch size, 48k Sample rate. (18 minutes dataset)
 
67
 
68
- Note:
69
- - For faruzan, somehow the index file is smaller, But it output a log when training here: <br />
70
- `Converged (lack of improvement in inertia) at step 1152/48215` <br />
71
- I might retrain faruzan soon.
 
8
  tags:
9
  - rvc
10
  ---
11
+ # <center> RVC Genshin Impact Japanese Voice Model
12
  ![model-cover.png](https://huggingface.co/ArkanDash/rvc-genshin-impact/resolve/main/model-cover.png)
13
 
14
+ ## About Retrieval based Voice Conversion (RVC)
15
+ Learn more about Retrieval based Voice Conversion in this link below:
16
  [RVC WebUI](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
17
 
18
+ ## How to use?
19
  Download the prezipped model and put to your RVC Project.
20
 
21
  Model test: [Google Colab](https://colab.research.google.com/drive/110kiMZTdP6Ri1lY9-NbQf17GVPPhHyeT?usp=sharing) / [RVC Models New](https://huggingface.co/spaces/ArkanDash/rvc-models-new) (Which is basically the same but hosted on spaces)
22
 
23
+ ## <center> INFO
24
+ Model Created by ArkanDash
 
25
  The voice that was used in this model belongs to Hoyoverse.
26
+ The voice I make to make this model was ripped from the game (3.6 - 4.0).
27
+ #### Total Models: 45 Models
28
+ V1 Models: 19
29
+ V2 Models: 26
30
 
31
+ Duplicate:
 
 
 
 
 
 
32
  - Zhongli (v1 & v2)
33
  - Nahida (v1 & v2)
34
 
35
+ Plans:
36
+ - Character from fontaine.
37
+ - v2 model recreation from v1 model
38
 
39
+ Note:
40
+ - For faruzan, somehow the index file is smaller, Might retrain faruzan.
41
+ Error message: `Converged (lack of improvement in inertia) at step 1152/48215` <br />
42
+ - Furina has only 20 minutes of dataset. (Will update the model in the future when its 1 hour long)
43
+ - New model will be created using v2 training, I'm no longer making v1 model.
44
 
45
  Have a request?
46
  I accept genshin character request if you want it.
47
+ Other request outside playable character:<br />
48
+ - Greater Lord Rukkhadevata: 750 Epochs, 16 Batch size, 48k Sample rate. (10 minutes dataset)
49
+ - Charlotte: 400 Epochs, 16 Batch size, 48k Sample rate. (18 minutes dataset)
50
 
51
+ ## <center> Model Training Information
 
 
 
 
52
 
53
+ ### V1 Model Training <br />
54
+ ##### This was trained on Original RVC.
55
+ Pitch Extract using Harvest.
56
+ This model was trained with 100 epochs, 10 batch sizes, and a 40K sample rate (some models had a 48k sample rate).
57
 
58
+ Every V1 model was trained more or less around 30 minutes of character voice.
 
 
 
 
59
 
60
+ ### V2 Model Training <br />
61
+ ##### This was trained on Mangio-Fork RVC.
62
+ Pitch Extract using Crepe.
63
+ This model was trained with 100 epochs, 8 batch sizes, and a 48K sample rate. (some models had a 40k sample rate).
64
 
65
+ Every V2 model was trained more or less around 60 minutes of character voice.
 
 
 
prezipped/v2/diona-jp 105 epochs 48k v2.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:42fc1ca0b526c8e08065f71fab4342b17fb5d72664cd1d30a350e174c22f518e
3
+ size 368400627
prezipped/v2/furina-jp 275 epochs 48k v2.zip ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:478c50f20df781bab3244c16f82c26d4a1d17906e360a8afa01ae19b12914812
3
+ size 153089927