arxiv:2407.14329
Xuenan Xu
wsntxxn
AI & ML interests
Text to Speech Synthesis
Text to Music Synthesis
Singing Voice Synthesis
Recent Activity
new activity
12 days ago
wsntxxn/cnn8rnn-audioset-sed:Adding `safetensors` variant of this model
new activity
15 days ago
wsntxxn/cnn14rnn-tempgru-audiocaps-captioning:Adding `safetensors` variant of this model
new activity
22 days ago
wsntxxn/effb2-trm-audiocaps-captioning:Adding `safetensors` variant of this model
Organizations
None yet
Papers
10
models
7
wsntxxn/cnn8rnn-audioset-sed
Audio Classification
•
Updated
•
268
•
2
wsntxxn/cnn14rnn-tempgru-audiocaps-captioning
Feature Extraction
•
Updated
•
169
•
1
wsntxxn/effb2-trm-audiocaps-captioning
Feature Extraction
•
Updated
•
138
•
1
wsntxxn/effb2-trm-clotho-captioning
Feature Extraction
•
Updated
•
151
•
1
wsntxxn/cnn8rnn-w2vmean-audiocaps-grounding
Audio Classification
•
Updated
•
111
•
2
wsntxxn/audiocaps-simple-tokenizer
Updated
wsntxxn/clotho-simple-tokenizer
Updated
datasets
None public yet