Truncation : Audio OP limitation

by avneetsingh - opened 5 days ago

5 days ago

Hi I am trying to use this to create audios of about 5 sentences in Bengali and Marathi but i am facing abrupt endings in my audio OPs. Any Solutions to this problem ?

avneetsingh

5 days ago

What i have read from the Parler TTS is that the max is OP is 30 seconds . Can we work around the same ?
https://github.com/huggingface/parler-tts/blob/main/training/README.md#3-training

avneetsingh changed discussion title from Truncation. to Truncation : Audio OP limitation 5 days ago

AshwinSankar

AI4Bharat org 3 days ago

Hi, Indic Parler-TTS can consistently generate sequences of upto 10-12 seconds. We are working on improving the model. But for your particular use case, I suggest splitting the sentences into different items and doing a batch generate instead as that will give the best quality while remaining consistent with the prompt that you have described.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment