Automatic Speech Recognition
NeMo
PyTorch
4 languages
automatic-speech-translation
speech
audio
Transformer
FastConformer
Conformer
NeMo
hf-asr-leaderboard
Eval Results
krishnacpuvvada commited on
Commit
0b65545
1 Parent(s): e685eb9

Model card draft

Browse files
Files changed (1) hide show
  1. README.md +167 -0
README.md CHANGED
@@ -1,3 +1,170 @@
1
  ---
2
  license: cc-by-nc-4.0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
3
  ---
 
1
  ---
2
  license: cc-by-nc-4.0
3
+ language:
4
+ - en
5
+ - de
6
+ - es
7
+ - fr
8
+ library_name: nemo
9
+ datasets:
10
+ - librispeech_asr
11
+ - fisher_corpus
12
+ - Switchboard-1
13
+ - WSJ-0
14
+ - WSJ-1
15
+ - National-Singapore-Corpus-Part-1
16
+ - National-Singapore-Corpus-Part-6
17
+ - vctk
18
+ - voxpopuli
19
+ - europarl
20
+ - multilingual_librispeech
21
+ - mozilla-foundation/common_voice_8_0
22
+ - MLCommons/peoples_speech
23
+ thumbnail: null
24
+ tags:
25
+ - automatic-speech-recognition
26
+ - speech
27
+ - audio
28
+ - Transducer
29
+ - FastConformer
30
+ - Conformer
31
+ - pytorch
32
+ - NeMo
33
+ - hf-asr-leaderboard
34
+ widget:
35
+ - example_title: Librispeech sample 1
36
+ src: https://cdn-media.huggingface.co/speech_samples/sample1.flac
37
+ - example_title: Librispeech sample 2
38
+ src: https://cdn-media.huggingface.co/speech_samples/sample2.flac
39
+ model-index:
40
+ - name: parakeet_rnnt_1.1b
41
+ results:
42
+ - task:
43
+ name: Automatic Speech Recognition
44
+ type: automatic-speech-recognition
45
+ dataset:
46
+ name: AMI (Meetings test)
47
+ type: edinburghcstr/ami
48
+ config: ihm
49
+ split: test
50
+ args:
51
+ language: en
52
+ metrics:
53
+ - name: Test WER
54
+ type: wer
55
+ value: 17.10
56
+ - task:
57
+ name: Automatic Speech Recognition
58
+ type: automatic-speech-recognition
59
+ dataset:
60
+ name: Earnings-22
61
+ type: revdotcom/earnings22
62
+ split: test
63
+ args:
64
+ language: en
65
+ metrics:
66
+ - name: Test WER
67
+ type: wer
68
+ value: 14.11
69
+ - task:
70
+ name: Automatic Speech Recognition
71
+ type: automatic-speech-recognition
72
+ dataset:
73
+ name: GigaSpeech
74
+ type: speechcolab/gigaspeech
75
+ split: test
76
+ args:
77
+ language: en
78
+ metrics:
79
+ - name: Test WER
80
+ type: wer
81
+ value: 9.96
82
+ - task:
83
+ name: Automatic Speech Recognition
84
+ type: automatic-speech-recognition
85
+ dataset:
86
+ name: LibriSpeech (clean)
87
+ type: librispeech_asr
88
+ config: other
89
+ split: test
90
+ args:
91
+ language: en
92
+ metrics:
93
+ - name: Test WER
94
+ type: wer
95
+ value: 1.46
96
+ - task:
97
+ name: Automatic Speech Recognition
98
+ type: automatic-speech-recognition
99
+ dataset:
100
+ name: LibriSpeech (other)
101
+ type: librispeech_asr
102
+ config: other
103
+ split: test
104
+ args:
105
+ language: en
106
+ metrics:
107
+ - name: Test WER
108
+ type: wer
109
+ value: 2.47
110
+ - task:
111
+ type: Automatic Speech Recognition
112
+ name: automatic-speech-recognition
113
+ dataset:
114
+ name: SPGI Speech
115
+ type: kensho/spgispeech
116
+ config: test
117
+ split: test
118
+ args:
119
+ language: en
120
+ metrics:
121
+ - name: Test WER
122
+ type: wer
123
+ value: 3.11
124
+ - task:
125
+ type: Automatic Speech Recognition
126
+ name: automatic-speech-recognition
127
+ dataset:
128
+ name: tedlium-v3
129
+ type: LIUM/tedlium
130
+ config: release1
131
+ split: test
132
+ args:
133
+ language: en
134
+ metrics:
135
+ - name: Test WER
136
+ type: wer
137
+ value: 3.92
138
+ - task:
139
+ name: Automatic Speech Recognition
140
+ type: automatic-speech-recognition
141
+ dataset:
142
+ name: Vox Populi
143
+ type: facebook/voxpopuli
144
+ config: en
145
+ split: test
146
+ args:
147
+ language: en
148
+ metrics:
149
+ - name: Test WER
150
+ type: wer
151
+ value: 5.39
152
+ - task:
153
+ type: Automatic Speech Recognition
154
+ name: automatic-speech-recognition
155
+ dataset:
156
+ name: Mozilla Common Voice 9.0
157
+ type: mozilla-foundation/common_voice_9_0
158
+ config: en
159
+ split: test
160
+ args:
161
+ language: en
162
+ metrics:
163
+ - name: Test WER
164
+ type: wer
165
+ value: 5.79
166
+
167
+ metrics:
168
+ - wer
169
+ pipeline_tag: automatic-speech-recognition
170
  ---