File size: 1,096 Bytes
5adddd1
 
c451c5f
 
 
 
5adddd1
 
c451c5f
5adddd1
c451c5f
875688d
 
 
5adddd1
 
c451c5f
5adddd1
c451c5f
5adddd1
c451c5f
128f228
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
---
library_name: transformers
tags:
- DAC
- Audio
license: mit
---

# This repository offers 16khzDAC with 9 codebooks and 8kbps bitrate.

For integration into ParlerTTS, you first need to install the [Parler-TTS library](https://github.com/huggingface/parler-tts) with (to do once):
```sh
pip install git+https://github.com/huggingface/parler-tts.git
```


# Descript Audio Codec (.dac): High-Fidelity Audio Compression with Improved RVQGAN

This repository is a wrapper around the original **Descript Audio Codec** model, a high fidelity general neural audio codec, introduced in the paper titled **High-Fidelity Audio Compression with Improved RVQGAN**.

It is designed to be used as a drop-in replacement of the [transformers implementation](https://huggingface.co/docs/transformers/v4.39.3/en/model_doc/encodec#overview) of [Encodec](https://github.com/facebookresearch/encodec), so that architectures that use Encodec can also be trained with DAC instead.
The [Parler-TTS library](https://github.com/huggingface/parler-tts) is an example of how to use DAC to train high-quality TTS models.