Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
fdaudens 
posted an update Nov 27
Post
1014
The rapid progress in small audio models is mind-blowing! 🤯 Just tested OuteTTS v0.2 - cloned my voice from a 10s clip with impressive accuracy and natural prosody.

At 500M parameters, it's efficient enough to run on basic hardware but powerful enough for professional use.

This could transform how we produce audio content for new - think instant translated interviews keeping original voices, or scaled audio article production!

Demo and Model on the Hub: OuteAI/OuteTTS-0.2-500M h/t @reach-vb

What tool are you using to generate that video?

No voice cloning yet, but an 80M model I trained makes this:

If the voice sounds familiar, it is, and the classifier seems to agree.

Screenshot 2024-11-27 at 6.20.17 PM.png

·

I used Descript for the video. How about you?