speechbrain
/

asr-whisper-medium-commonvoice-fa

@@ -1,6 +1,6 @@
 ---
 language:
-- ar
 thumbnail: null
 pipeline_tag: automatic-speech-recognition
 tags:
@@ -16,31 +16,31 @@ metrics:
 - wer
 - cer
 model-index:
-- name: asr-whisper-medium-commonvoice-ar
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: CommonVoice 10.0 (Arabic)
       type: mozilla-foundation/common_voice_14_0
-      config: ar
       split: test
       args:
-        language: ar
     metrics:
     - name: Test WER
       type: wer
-      value: '14.82'
 ---
 <iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=medium" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
 <br/><br/>
-# whisper medium fine-tuned on CommonVoice-14.0 Arabic
 This repository provides all the necessary tools to perform automatic speech
-recognition from an end-to-end whisper model fine-tuned on CommonVoice (Arabic Language) within
 SpeechBrain. For a better experience, we encourage you to learn more about
 [SpeechBrain](https://speechbrain.github.io).
@@ -48,7 +48,7 @@ The performance of the model is the following:
 | Release | Test CER | Test WER | GPUs |
 |:-------------:|:--------------:|:--------------:| :--------:|
-| 1-08-23 | 4.95 |  14.82 | 1xV100 32GB |
 ## Pipeline description
@@ -72,14 +72,14 @@ pip install speechbrain transformers
 Please notice that we encourage you to read our tutorials and learn more about
 [SpeechBrain](https://speechbrain.github.io).
-### Transcribing your own audio files (in Arabic)
 ```python
 from speechbrain.pretrained import WhisperASR
-asr_model = WhisperASR.from_hparams(source="speechbrain/asr-whisper-medium-commonvoice-ar", savedir="pretrained_models/asr-whisper-medium-commonvoice-ar")
-asr_model.transcribe_file("speechbrain/asr-whisper-lmedium-commonvoice-ar/example-ar.mp3")
 ```
@@ -103,7 +103,7 @@ pip install -e .
 3. Run Training:
 ```bash
 cd recipes/CommonVoice/ASR/transformer/
-python train_with_whisper.py hparams/train_ar_hf_whisper.yaml --data_folder=your_data_folder
 ```
 You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/11PKCsyIE703mmDv6n6n_UnD0bUgMPbg_?usp=share_link).

 ---
 language:
+- fa
 thumbnail: null
 pipeline_tag: automatic-speech-recognition
 tags:
 - wer
 - cer
 model-index:
+- name: asr-whisper-medium-commonvoice-fa
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: CommonVoice 10.0 (Farsi)
       type: mozilla-foundation/common_voice_14_0
+      config: fa
       split: test
       args:
+        language: fa
     metrics:
     - name: Test WER
       type: wer
+      value: '29.01'
 ---
 <iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=medium" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
 <br/><br/>
+# whisper medium fine-tuned on CommonVoice-14.0 Farsi
 This repository provides all the necessary tools to perform automatic speech
+recognition from an end-to-end whisper model fine-tuned on CommonVoice (Fasri Language) within
 SpeechBrain. For a better experience, we encourage you to learn more about
 [SpeechBrain](https://speechbrain.github.io).
 | Release | Test CER | Test WER | GPUs |
 |:-------------:|:--------------:|:--------------:| :--------:|
+| 1-08-23 | 8.58 |  29.01 | 1xV100 32GB |
 ## Pipeline description
 Please notice that we encourage you to read our tutorials and learn more about
 [SpeechBrain](https://speechbrain.github.io).
+### Transcribing your own audio files (in Farsi)
 ```python
 from speechbrain.pretrained import WhisperASR
+asr_model = WhisperASR.from_hparams(source="speechbrain/asr-whisper-medium-commonvoice-fa", savedir="pretrained_models/asr-whisper-medium-commonvoice-fa")
+asr_model.transcribe_file("speechbrain/asr-whisper-lmedium-commonvoice-fa/example-fa.mp3")
 ```
 3. Run Training:
 ```bash
 cd recipes/CommonVoice/ASR/transformer/
+python train_with_whisper.py hparams/train_fa_hf_whisper.yaml --data_folder=your_data_folder
 ```
 You can find our training results (models, logs, etc) [here](https://drive.google.com/drive/folders/11PKCsyIE703mmDv6n6n_UnD0bUgMPbg_?usp=share_link).