Since new TTS (Text-to-Speech) systems are coming out what feels like every day, and it's currently hard to compare them, my latest project has focused on doing just that.

I was inspired by the TTS-AGI/TTS-Arena (definitely check it out if you haven't), which compares recent TTS system using crowdsourced A/B testing.

I wanted to see if we can also do a similar evaluation with objective metrics and it's now available here:
ttsds/benchmark
Anyone can submit a new TTS model, and I hope this can provide a way to get some information on which areas models perform well or poorly in.

The paper with all the details is available here: https://arxiv.org/abs/2407.12707

liked a Space 5 months ago

Sleeping

⚡

Krushna Ganpat Bhosle

AI & ML interests

Recent Activity

Organizations

krushnabhosle's activity

Nerfies: Deformable Neural Radiance Fields

Reverse Image Video Search

TinyChart 3B

Qwen2.5-Coder-7B-Instruct

CLIP GamePhysics

Youtube Download

Canibal SEO

Ultrapixel-demo

HivisionIDPhotos

Nutrition Table Content Analysis

Food Indentification And Nutrition Info

Midi Writer

Poetry Writer

Style Transfer

Text To Gif

Which Fashion Collection ?

Water Body Segmentation

— Zero GPU Spaces —

Yukthi