File size: 702 Bytes
d47b265 287eb52 d47b265 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 |
---
datasets:
- mesolitica/Malaysian-Emilia
language:
- ms
- en
base_model:
- charactr/vocos-mel-24khz
---
# Malaysian Vocos
Continue pretraining [charactr/vocos-mel-24khz](https://huggingface.co/charactr/vocos-mel-24khz) on [Malaysian Emilia](https://huggingface.co/datasets/mesolitica/Malaysian-Emilia) to make it more crispy for Malaysian context!
## Installation
To use Vocos only in inference mode, install it using:
```bash
pip install vocos
```
## Usage
### Reconstruct audio from mel-spectrogram
```python
import torch
from vocos import Vocos
vocos = Vocos.from_pretrained("mesolitica/malaysian-vocos-mel-24khz")
mel = torch.randn(1, 100, 256) # B, C, T
audio = vocos.decode(mel)
``` |