Update README.md
Browse files
README.md
CHANGED
@@ -29,7 +29,7 @@ model-index:
|
|
29 |
value: 21.9
|
30 |
---
|
31 |
# Wav2Vec2-Large-Japanese
|
32 |
-
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on Japanese using the [Common Voice](https://huggingface.co/datasets/common_voice),
|
33 |
|
34 |
When using this model, make sure that your speech input is sampled at 16kHz.
|
35 |
|
@@ -124,10 +124,11 @@ references = [x.upper() for x in result["sentence"]]
|
|
124 |
print(f"WER: {wer.compute(predictions=predictions, references=references, chunk_size=1000) * 100}")
|
125 |
print(f"CER: {cer.compute(predictions=predictions, references=references, chunk_size=1000) * 100}")
|
126 |
```
|
|
|
127 |
**Test Result**:
|
128 |
In the table below I report the Word Error Rate (WER) and the Character Error Rate (CER) of the model. I ran the evaluation script described above on other models as well (on 2021-05-10). Note that the table below may show different results from those already reported, this may have been caused due to some specificity of the other evaluation scripts used.
|
129 |
| Model | WER | CER |
|
130 |
| ------------- | ------------- | ------------- |
|
131 |
-
| jonatasgrosman/wav2vec2-large-xlsr-53-japanese | **81.
|
132 |
| vumichien/wav2vec2-large-xlsr-japanese | 1108.86% | 23.40% |
|
133 |
| qqhann/w2v_hf_jsut_xlsr53 | 1012.18% | 70.77% |
|
|
|
29 |
value: 21.9
|
30 |
---
|
31 |
# Wav2Vec2-Large-Japanese
|
32 |
+
Fine-tuned [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on Japanese using the [Common Voice](https://huggingface.co/datasets/common_voice),[JSUT](https://sites.google.com/site/shinnosuketakamichi/publication/jsut), [TEDxJP](https://github.com/laboroai/TEDxJP-10K) and some other data.
|
33 |
|
34 |
When using this model, make sure that your speech input is sampled at 16kHz.
|
35 |
|
|
|
124 |
print(f"WER: {wer.compute(predictions=predictions, references=references, chunk_size=1000) * 100}")
|
125 |
print(f"CER: {cer.compute(predictions=predictions, references=references, chunk_size=1000) * 100}")
|
126 |
```
|
127 |
+
|
128 |
**Test Result**:
|
129 |
In the table below I report the Word Error Rate (WER) and the Character Error Rate (CER) of the model. I ran the evaluation script described above on other models as well (on 2021-05-10). Note that the table below may show different results from those already reported, this may have been caused due to some specificity of the other evaluation scripts used.
|
130 |
| Model | WER | CER |
|
131 |
| ------------- | ------------- | ------------- |
|
132 |
+
| jonatasgrosman/wav2vec2-large-xlsr-53-japanese | **81.30%** | **21.9%** |
|
133 |
| vumichien/wav2vec2-large-xlsr-japanese | 1108.86% | 23.40% |
|
134 |
| qqhann/w2v_hf_jsut_xlsr53 | 1012.18% | 70.77% |
|