Audio Entailment: Assessing Deductive Reasoning for Audio Understanding Paper • 2407.18062 • Published Jul 25, 2024
ML-SUPERB: Multilingual Speech Universal PERformance Benchmark Paper • 2305.10615 • Published May 18, 2023 • 1
Joint Prediction and Denoising for Large-scale Multilingual Self-supervised Learning Paper • 2309.15317 • Published Sep 26, 2023
Reproducing Whisper-Style Training Using an Open-Source Toolkit and Publicly Available Data Paper • 2309.13876 • Published Sep 25, 2023 • 1
Improving Massively Multilingual ASR With Auxiliary CTC Objectives Paper • 2302.12829 • Published Feb 24, 2023
OWSM v3.1: Better and Faster Open Whisper-Style Speech Models based on E-Branchformer Paper • 2401.16658 • Published Jan 30, 2024 • 13
cmu-mlsp/librispeech960-wavlm-large-km1000_asr_tokenized_final_fixed Viewer • Updated Dec 7, 2023 • 576k • 109
cmu-mlsp/librispeech960-wavlm-large-km1000_asr_tokenized_final Viewer • Updated Dec 5, 2023 • 295k • 33
cmu-mlsp/hubert_layer9-librispeech-asr100h_tokenized_final_asr Viewer • Updated Nov 30, 2023 • 39.2k • 31
cmu-mlsp/encodec_24khz-opt-125m-lm_pretraining_ls960_1qt-librispeech_asr-test.clean-features Viewer • Updated Nov 10, 2023 • 2.62k • 38
cmu-mlsp/encodec_24khz-opt-125m-lm_pretraining_ls960_1qt-librispeech_asr-train.clean.100-features Viewer • Updated Nov 10, 2023 • 10 • 29