README.md · shaojieli/icefall-asr-commonvoice-fr-pruned-transducer-stateless7-streaming-2023-04-02 at b1514287ac57ea7f79e47e9de5df818af4dac140

metadata

license: apache-2.0
datasets:
  - mozilla-foundation/common_voice_12_0
language:
  - fr
metrics:
  - wer
pipeline_tag: automatic-speech-recognition

training on full commonvoice

The WERs are:

decoding method	chunk size	test	comment	decoding mode
greedy search	640ms	10.90	--epoch 30 --avg 9	simulated streaming
modified beam search	640ms	10.55	--epoch 30 --avg 9	simulated streaming
fast beam search	640ms	10.75	--epoch 30 --avg 9	simulated streaming

training on full librispeech then finetune on full commonvoice

The WERs are:

decoding method	chunk size	test	comment	decoding mode
greedy search	640ms	10.57	--epoch 29 --avg 9	simulated streaming
modified beam search	640ms	10.19	--epoch 29 --avg 9	simulated streaming
fast beam search	640ms	10.25	--epoch 29 --avg 9	simulated streaming

training on full librispeech and gigaspeech then finetune on full commonvoice

The WERs are:

decoding method	chunk size	test	comment	decoding mode
greedy search	640ms	9.95	--epoch 30 --avg 9	simulated streaming
modified beam search	640ms	9.57	--epoch 30 --avg 9	simulated streaming
fast beam search	640ms	9.67	--epoch 30 --avg 9	simulated streaming