Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond Paper • 2408.03900 • Published Aug 7 • 9
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22 • 110
Vietnamese speech dataset Collection for speech-related tasks: speech-to-text & text-to-speech • 24 items • Updated Jul 8 • 6