File size: 1,207 Bytes
3aa9274
 
f78a2b3
 
 
 
 
 
 
f58997a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
---
license: apache-2.0
datasets:
- mozilla-foundation/common_voice_12_0
language:
- fr
metrics:
- wer
pipeline_tag: automatic-speech-recognition
---

training on full commonvoice

The WERs are:

| decoding method      | chunk size |  test  |        comment       |     decoding mode    |
| -------------------- | ---------- | ------ | -------------------- | -------------------- |
| greedy search        | 640ms      | 10.90  | --epoch 30 --avg 9   | simulated streaming  |
| modified beam search | 640ms      | 10.55  | --epoch 30 --avg 9   | simulated streaming  |
| fast beam search     | 640ms      | 10.75  | --epoch 30 --avg 9   | simulated streaming  |

training on full librispeech then finetune on full commonvoice

The WERs are:

| decoding method      | chunk size |  test  |        comment       |     decoding mode    |
| -------------------- | ---------- | ------ | -------------------- | -------------------- |
| greedy search        | 640ms      | 10.57  | --epoch 29 --avg 9   | simulated streaming  |
| modified beam search | 640ms      | 10.19  | --epoch 29 --avg 9   | simulated streaming  |
| fast beam search     | 640ms      | 10.25  | --epoch 29 --avg 9   | simulated streaming  |