marcel
/

wav2vec2-large-xlsr-53-german

Automatic Speech Recognition

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

marcel commited on Mar 29, 2021

Commit

0c68cd2

•

1 Parent(s): 530b5f4

WER

Files changed (1) hide show

README.md +5 -6

README.md CHANGED Viewed

@@ -22,12 +22,12 @@ model-index:
     metrics:
        - name: Test WER
          type: wer
-         value: 15.91
 ---
 # Wav2Vec2-Large-XLSR-53-German
-Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on German using 12% of the [Common Voice](https://huggingface.co/datasets/common_voice) dataset.
 When using this model, make sure that your speech input is sampled at 16kHz.
 ## Usage
@@ -79,7 +79,7 @@ from datasets import load_dataset, load_metric
 from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor
 import re
-test_dataset = load_dataset("common_voice", "de", split="test[:10%]")
 wer = load_metric("wer")
 processor = Wav2Vec2Processor.from_pretrained("marcel/wav2vec2-large-xlsr-53-german")
@@ -140,11 +140,10 @@ result = test_dataset.map(evaluate, batched=True, batch_size=8)
 print("WER: {:2f}".format(100 * wer.compute(predictions=result["pred_strings"], references=result["sentence"])))
 ```
-**Test Result**: 15.91 %
 ## Training
-The first 12% of the Common Voice `train`, `validation` datasets were used for training.
-The script used for training can be found TODO

     metrics:
        - name: Test WER
          type: wer
+         value: 15.80
 ---
 # Wav2Vec2-Large-XLSR-53-German
+Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on German using the [Common Voice](https://huggingface.co/datasets/common_voice) dataset.
 When using this model, make sure that your speech input is sampled at 16kHz.
 ## Usage
 from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor
 import re
+test_dataset = load_dataset("common_voice", "de", split="test")
 wer = load_metric("wer")
 processor = Wav2Vec2Processor.from_pretrained("marcel/wav2vec2-large-xlsr-53-german")
 print("WER: {:2f}".format(100 * wer.compute(predictions=result["pred_strings"], references=result["sentence"])))
 ```
+**Test Result**: 15.80 %
 ## Training
+The first 50% of the Common Voice `train`, and 12% of the `validation` datasets were used for training (30 epochs on first 12% and 3 epochs on the remainder).