SKostandian
commited on
Commit
•
c4521cd
1
Parent(s):
bc37aee
Update README.md
Browse files
README.md
CHANGED
@@ -188,7 +188,7 @@ The tokenizer for these model was built using the text transcripts of the train
|
|
188 |
The model is trained on composite dataset comprising of around 8500 hours of English speech:
|
189 |
|
190 |
- [Librispeech](https://www.openslr.org/12)
|
191 |
-
- Data Collection Method:
|
192 |
- Labeling Method: by Human
|
193 |
- [Mozilla Common Voice 11.0 English](https://commonvoice.mozilla.org/en/datasets)
|
194 |
- Data Collection Method: by Human
|
@@ -197,10 +197,10 @@ The model is trained on composite dataset comprising of around 8500 hours of Eng
|
|
197 |
- Data Collection Method: by Human
|
198 |
- Labeling Method: by Human
|
199 |
- [Fisher](https://catalog.ldc.upenn.edu/LDC2004S13)
|
200 |
-
- Data Collection Method:
|
201 |
- Labeling Method: by Human
|
202 |
- [MLS](https://www.openslr.org/94/)
|
203 |
-
- Data Collection Method:
|
204 |
- Labeling Method: by Human
|
205 |
- [Voxpopuli](https://github.com/facebookresearch/voxpopuli)
|
206 |
- Data Collection Method: by Human
|
|
|
188 |
The model is trained on composite dataset comprising of around 8500 hours of English speech:
|
189 |
|
190 |
- [Librispeech](https://www.openslr.org/12)
|
191 |
+
- Data Collection Method: Automated
|
192 |
- Labeling Method: by Human
|
193 |
- [Mozilla Common Voice 11.0 English](https://commonvoice.mozilla.org/en/datasets)
|
194 |
- Data Collection Method: by Human
|
|
|
197 |
- Data Collection Method: by Human
|
198 |
- Labeling Method: by Human
|
199 |
- [Fisher](https://catalog.ldc.upenn.edu/LDC2004S13)
|
200 |
+
- Data Collection Method: Automated
|
201 |
- Labeling Method: by Human
|
202 |
- [MLS](https://www.openslr.org/94/)
|
203 |
+
- Data Collection Method: Automated
|
204 |
- Labeling Method: by Human
|
205 |
- [Voxpopuli](https://github.com/facebookresearch/voxpopuli)
|
206 |
- Data Collection Method: by Human
|