Shivam Mehta's picture

2 8 8

Shivam Mehta

shivammehta25

·

http://www.shivammehta.me

AI & ML interests

Speech, Audio, LLM, Flow Matching, Diffusion, Flows, HMMs

Recent Activity

liked a Space about 1 month ago

kyutai/hibiki-samples

upvoted a paper 6 months ago

EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer

upvoted a paper 9 months ago

XTTS: a Massively Multilingual Zero-Shot Text-to-Speech Model

View all activity

Organizations

Papers 7

arxiv:2404.19622

arxiv:2309.05455

arxiv:2309.03199

arxiv:2306.09417

spaces 2

Matcha TTS

Generate speech from text input

Diff-TTSG: Denoising probabilistic integrated speech and gesture synthesis

models 2

shivammehta25/sd-class-butterflies-32

Unconditional Image Generation • Updated Nov 29, 2022 • 16

shivammehta25/Neural-HMM

Updated Nov 21, 2021

datasets

None public yet