HKUST Audio's picture

13 5 6

HKUST Audio PRO

HKUST-Audio

·

wxue_audio

AI & ML interests

Audio Generation

Recent Activity

updated a collection 2 days ago

Our AK Daily Papers

updated a dataset 3 days ago

HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized

new activity 3 days ago

HKUSTAudio/Llasa-3B:Speech Generation Works Sometimes, But Fails Randomly

View all activity

Organizations

HKUST-Audio's activity

upvoted a paper 3 days ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published 4 days ago • 15

upvoted a collection 10 days ago

Llasa

TTS foundation model compatible with Llama framework (160k hours tokenized speech data released) • 7 items • Updated 3 days ago • 5

upvoted an article 20 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

By

•

20 days ago

• 60

upvoted a paper about 1 month ago

Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

Paper • 2408.17175 • Published Aug 30, 2024 • 2

upvoted a paper 8 months ago

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 31