jhj0517 commited on
Commit
2109221
·
1 Parent(s): d74c8ff

Add citation in README

Browse files
Files changed (2) hide show
  1. README.md +2 -3
  2. modules/uvr/music_separator.py +0 -1
README.md CHANGED
@@ -25,6 +25,7 @@ If you wish to try this on Colab, you can do it in [here](https://colab.research
25
  - Translate subtitle files using Facebook NLLB models
26
  - Translate subtitle files using DeepL API
27
  - Pre-processing audio input with [Silero VAD](https://github.com/snakers4/silero-vad).
 
28
  - Post-processing with speaker diarization using the [pyannote](https://huggingface.co/pyannote/speaker-diarization-3.1) model.
29
  - To download the pyannote model, you need to have a Huggingface token and manually accept their terms in the pages below.
30
  1. https://huggingface.co/pyannote/speaker-diarization-3.1
@@ -109,8 +110,6 @@ This is Whisper's original VRAM usage table for models.
109
  - [x] Integrate with faster-whisper
110
  - [x] Integrate with insanely-fast-whisper
111
  - [x] Integrate with whisperX ( Only speaker diarization part )
112
- - [ ] Add background music separation pre-processing with [MVSEP-MDX23](https://github.com/ZFTurbo/MVSEP-MDX23-music-separation-model)
113
  - [ ] Add fast api script
114
  - [ ] Support real-time transcription for microphone
115
-
116
-
 
25
  - Translate subtitle files using Facebook NLLB models
26
  - Translate subtitle files using DeepL API
27
  - Pre-processing audio input with [Silero VAD](https://github.com/snakers4/silero-vad).
28
+ - Pre-processing audio input to separate BGM with [UVR](https://github.com/Anjok07/ultimatevocalremovergui), [UVR-api](https://github.com/NextAudioGen/ultimatevocalremover_api).
29
  - Post-processing with speaker diarization using the [pyannote](https://huggingface.co/pyannote/speaker-diarization-3.1) model.
30
  - To download the pyannote model, you need to have a Huggingface token and manually accept their terms in the pages below.
31
  1. https://huggingface.co/pyannote/speaker-diarization-3.1
 
110
  - [x] Integrate with faster-whisper
111
  - [x] Integrate with insanely-fast-whisper
112
  - [x] Integrate with whisperX ( Only speaker diarization part )
113
+ - [x] Add background music separation pre-processing with [UVR](https://github.com/Anjok07/ultimatevocalremovergui)
114
  - [ ] Add fast api script
115
  - [ ] Support real-time transcription for microphone
 
 
modules/uvr/music_separator.py CHANGED
@@ -1,4 +1,3 @@
1
- # Credit to Team UVR : https://github.com/Anjok07/ultimatevocalremovergui
2
  from typing import Optional, Union
3
  import numpy as np
4
  import torchaudio
 
 
1
  from typing import Optional, Union
2
  import numpy as np
3
  import torchaudio