Singh's picture
2 1

Singh

jslinuxta
ยท

AI & ML interests

None yet

Recent Activity

View all activity

Organizations

None yet

jslinuxta's activity

reacted to hexgrad's post with ๐Ÿ”ฅ 7 days ago
view post
Post
4944
Wanted: Peak Data. I'm collecting audio data to train another TTS model:
+ AVM data: ChatGPT Advanced Voice Mode audio & text from source
+ Professional audio: Permissive (CC0, Apache, MIT, CC-BY)

This audio should *impress* most native speakers, not just barely pass their audio Turing tests. Professional-caliber means S or A-tier, not your average bloke off the street. Traditional TTS may not make the cut. Absolutely no low-fi microphone recordings like Common Voice.

The bar is much higher than last time, so there are no timelines yet and I expect it may take longer to collect such mythical data. Raising the bar means evicting quite a bit of old data, and voice/language availability may decrease. The theme is *quality* over quantity. I would rather have 1 hour of A/S-tier than 100 hours of mid data.

I have nothing to offer but the north star of a future Apache 2.0 TTS model, so prefer data that you *already have* and costs you *nothing extra* to send. Additionally, *all* the new data may be used to construct public, Apache 2.0 voicepacks, and if that arrangement doesn't work for you, no need to send any audio.

Last time I asked for horses; now I'm asking for unicorns. As of writing this post, I've currently got a few English & Chinese unicorns, but there is plenty of room in the stable. Find me over on Discord at rzvzn: https://discord.gg/QuGxSWBfQy
  • 1 reply
ยท
reacted to Pendrokar's post with โค๏ธ 11 days ago
view post
Post
3030
TTS: Added Kokoro v1, Parler Large, LlaSa 3B & MARS 6 TTS models to the Arena.
Pendrokar/TTS-Spaces-Arena

Also had added MaskGCT, GPT-SoVITS & OuteTTS a month ago. OuteTTS devs did say that is too early for it to be added to TTS Arenas.

Mars 5 does have a space with open weights models, but inference is way too slow (2 minutes+).
  • 2 replies
ยท
reacted to hexgrad's post with ๐Ÿ”ฅ 16 days ago
New activity in hexgrad/Kokoro-82M 16 days ago
New activity in hexgrad/Kokoro-82M about 1 month ago
reacted to hexgrad's post with ๐Ÿ”ฅ about 2 months ago
view post
Post
2850
๐Ÿ‡ฌ๐Ÿ‡ง Four British voices have joined hexgrad/Kokoro-82M (Apache TTS model): bf_emma, bf_isabella, bm_george, bm_lewis