Update README.md
Browse filesA couple minor edits to performance section.
README.md
CHANGED
@@ -1,10 +1,10 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
language:
|
4 |
-
- multilingual
|
5 |
-
tags:
|
6 |
-
- automatic-speech-recognition
|
7 |
-
---
|
8 |
|
9 |
# reazonspeech-k2-v2-ja-en
|
10 |
|
@@ -61,7 +61,7 @@ Character Error Rates (CERs) for Japanese listed below:
|
|
61 |
| greedy search | 12.56 | 6.93 | 9.75 | 9.67 |
|
62 |
| modified beam search | 11.59 | 6.97 | 9.55 | 9.51 |
|
63 |
|
64 |
-
Additional tests were performed with manually procurred audio files.
|
65 |
The model performs reasonably well as long as the input audio contains a single language.
|
66 |
However when multiple languages are included in the same input, the model struggles to provide an accurate transcription (see test_multi).
|
67 |
This result can be avoided by properly segmenting audio into chunks, separated by pauses in speech.
|
@@ -70,7 +70,7 @@ This result can be avoided by properly segmenting audio into chunks, separated b
|
|
70 |
- test_ja_2: 26% (CER)
|
71 |
- test_multi: 99% (CER)
|
72 |
- test_en_1: 12% (WER)
|
73 |
-
- test_en_2:
|
74 |
|
75 |
|
76 |
## License
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
language:
|
4 |
+
- multilingual
|
5 |
+
tags:
|
6 |
+
- automatic-speech-recognition
|
7 |
+
---
|
8 |
|
9 |
# reazonspeech-k2-v2-ja-en
|
10 |
|
|
|
61 |
| greedy search | 12.56 | 6.93 | 9.75 | 9.67 |
|
62 |
| modified beam search | 11.59 | 6.97 | 9.55 | 9.51 |
|
63 |
|
64 |
+
Additional tests were performed with manually procurred audio files (see test_wavs/transcripts.txt).
|
65 |
The model performs reasonably well as long as the input audio contains a single language.
|
66 |
However when multiple languages are included in the same input, the model struggles to provide an accurate transcription (see test_multi).
|
67 |
This result can be avoided by properly segmenting audio into chunks, separated by pauses in speech.
|
|
|
70 |
- test_ja_2: 26% (CER)
|
71 |
- test_multi: 99% (CER)
|
72 |
- test_en_1: 12% (WER)
|
73 |
+
- test_en_2: 27% (WER)
|
74 |
|
75 |
|
76 |
## License
|