huseinzol05's picture
Update README.md
287eb52 verified
|
raw
history blame
702 Bytes
metadata
datasets:
  - mesolitica/Malaysian-Emilia
language:
  - ms
  - en
base_model:
  - charactr/vocos-mel-24khz

Malaysian Vocos

Continue pretraining charactr/vocos-mel-24khz on Malaysian Emilia to make it more crispy for Malaysian context!

Installation

To use Vocos only in inference mode, install it using:

pip install vocos

Usage

Reconstruct audio from mel-spectrogram

import torch

from vocos import Vocos

vocos = Vocos.from_pretrained("mesolitica/malaysian-vocos-mel-24khz")

mel = torch.randn(1, 100, 256)  # B, C, T
audio = vocos.decode(mel)