--- language: - sw license: apache-2.0 tags: - hf-asr-leaderboard - generated_from_trainer datasets: - mozilla-foundation/common_voice_11_0 metrics: - wer model-index: - name: whisper-large-v2-sw results: - task: name: Automatic Speech Recognition type: automatic-speech-recognition dataset: name: Common Voice 11.0 type: mozilla-foundation/common_voice_11_0 config: sw split: test args: 'config: sw, split: test' metrics: - name: Wer type: wer value: 30.7 --- ## Model * Name: Whisper Large-v2 Swahili * Description: Whisper weights for speech-to-text task, fine-tuned and evaluated on normalized data. * Dataset: - Train and validation splits for Swahili subsets of [Common Voice 11.0](https://huggingface.co/datasets/mozilla-foundation/common_voice_11_0). - Train, validation and test splits for Swahili subsets of [Google Fleurs](https://huggingface.co/datasets/google/fleurs/). * Performance: **30.7 WER** ## Weights * Date of release: 12.09.2022 * License: MIT ## Usage To use these weights in HuggingFace's `transformers` library, you can do the following: ```python from transformers import WhisperForConditionalGeneration model = WhisperForConditionalGeneration.from_pretrained("hedronstone/whisper-large-v2-sw") ```