voidful
/

wav2vec2-large-xlsr-53-hk

Automatic Speech Recognition

hf-asr-leaderboard

robust-speech-event

xlsr-fine-tuning-week

Inference Endpoints

Model card Files Files and versions Community

patrickvonplaten commited on Mar 22, 2021

Commit

fe26a2e

•

1 Parent(s): 97cd4e4

Update README.md

Files changed (1) hide show

README.md +16 -2

README.md CHANGED Viewed

@@ -3,11 +3,25 @@ language: zh
 datasets:
 - common_voice
 tags:
-- speech
 - audio
 - automatic-speech-recognition
 - xlsr-fine-tuning-week
 license: apache-2.0
 ---
 ## Colab trial with recording or voice file
@@ -80,7 +94,7 @@ chars_to_ignore_regex = r"[¥•＂＃＄％＆＇（）＊＋，－／：；＜
 model = Wav2Vec2ForCTC.from_pretrained(model_name).to(device)
 processor = Wav2Vec2Processor.from_pretrained(processor_name)
-ds = load_dataset("common_voice", 'zh-HK', data_dir="./cv-corpus-6.1-2020-12-11", split="test")
 resampler = torchaudio.transforms.Resample(orig_freq=48_000, new_freq=16_000)

 datasets:
 - common_voice
 tags:
 - audio
 - automatic-speech-recognition
+- speech
 - xlsr-fine-tuning-week
 license: apache-2.0
+model-index:
+- name: XLSR Wav2Vec2 Chinese (Hong Kong) by Voidful
+  results:
+  - task:
+      name: Speech Recognition
+      type: automatic-speech-recognition
+    dataset:
+      name: Common Voice zh-HK
+      type: common_voice
+      args: zh-HK
+    metrics:
+       - name: Test CER
+         type: cer
+         value: 76.57
 ---
 ## Colab trial with recording or voice file
 model = Wav2Vec2ForCTC.from_pretrained(model_name).to(device)
 processor = Wav2Vec2Processor.from_pretrained(processor_name)
+ds = load_dataset("common_voice", 'zh-HK', split="test")
 resampler = torchaudio.transforms.Resample(orig_freq=48_000, new_freq=16_000)