alvanlii
/

whisper-small-cantonese

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

alvanlii commited on Feb 22

Commit

2e5181f

•

1 Parent(s): c8c2a58

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -33,6 +33,9 @@ This model is a fine-tuned version of [openai/whisper-small](https://huggingface
 ## Training and evaluation data
 For training,
 |Name|# of Hours|
 |--|--|
 |Common Voice 16.0 zh-HK Train|138|
@@ -42,8 +45,6 @@ For training,
 |Pseudo-Labelled YouTube Data|438|
 |Total|756|
-- CantoMap: Winterstein, Grégoire, Tang, Carmen and Lai, Regine (2020) "CantoMap: a Hong Kong Cantonese MapTask Corpus", in Proceedings of The 12th Language Resources and Evaluation Conference, Marseille: European Language Resources Association, p. 2899-2906.
-- Cantonse-ASR: Yu, Tiezheng, Frieske, Rita, Xu, Peng, Cahyawijaya, Samuel, Yiu, Cheuk Tung, Lovenia, Holy, Dai, Wenliang, Barezi, Elham, Chen, Qifeng, Ma, Xiaojuan, Shi, Bertram, Fung, Pascale (2022) "Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset", 2022. Link: https://arxiv.org/pdf/2201.02419.pdf
 For evaluation, Common Voice 16.0 yue Test set is used.

 ## Training and evaluation data
 For training,
+- CantoMap: Winterstein, Grégoire, Tang, Carmen and Lai, Regine (2020) "CantoMap: a Hong Kong Cantonese MapTask Corpus", in Proceedings of The 12th Language Resources and Evaluation Conference, Marseille: European Language Resources Association, p. 2899-2906.
+- Cantonse-ASR: Yu, Tiezheng, Frieske, Rita, Xu, Peng, Cahyawijaya, Samuel, Yiu, Cheuk Tung, Lovenia, Holy, Dai, Wenliang, Barezi, Elham, Chen, Qifeng, Ma, Xiaojuan, Shi, Bertram, Fung, Pascale (2022) "Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset", 2022. Link: https://arxiv.org/pdf/2201.02419.pdf
 |Name|# of Hours|
 |--|--|
 |Common Voice 16.0 zh-HK Train|138|
 |Pseudo-Labelled YouTube Data|438|
 |Total|756|
 For evaluation, Common Voice 16.0 yue Test set is used.