Extract SSL Unit with BPE
This repo contains codes to extract Hubert units and perform BPE on them.
The SSL feature is extracted from the 21st layer of TencentGameMate/chinese-hubert-large
The number of BPE token is set to 6000.
Usage
python ssl_unit_encoder.py --audio test.wav