@fdaudens on Hugging Face: "The rapid progress in small audio models is mind-blowing! 🤯 Just tested…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

fdaudens

posted an update Nov 27

Post

1014

The rapid progress in small audio models is mind-blowing! 🤯 Just tested OuteTTS v0.2 - cloned my voice from a 10s clip with impressive accuracy and natural prosody.

At 500M parameters, it's efficient enough to run on basic hardware but powerful enough for professional use.

This could transform how we produce audio content for new - think instant translated interviews keeping original voices, or scaled audio article production!

Demo and Model on the Hub: OuteAI/OuteTTS-0.2-500M h/t @reach-vb

hexgrad

Nov 28

What tool are you using to generate that video?

No voice cloning yet, but an 80M model I trained makes this:

If the voice sounds familiar, it is, and the classifier seems to agree.

fdaudens

Nov 28

I used Descript for the video. How about you?

In this post

fdaudens Florent Daudens
hexgrad Hexgrad