How to make semantic data from wav files?

#1
by DeveloperEdy - opened

Hi, I am Edy who want to make Korean tokenizer for bark voice cloning.
I am wondering how to make semantic data from wav source files in Japanese.
I appreciate any helps.

You need to follow these steps:

  1. Generate semantic data from text(Korean content)
  2. Generate wav files from above semantic data
  3. Train Korean model from wavs and semantic data. Wav files is input of this model, semantic data is the output.

Basically, this model is used to predict semantic data from wav file. For more details, you can check my train script .

Thank you for the instruction. Just curiosity at step 1. creating data -- I saw it runs infinitely.

Do you have any recommendation how many semantic files should we produce? is there any magic number?

Sign up or log in to comment