sagorsarker commited on
Commit
5d3b81e
1 Parent(s): 1bb676c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md CHANGED
@@ -52,3 +52,30 @@ For training the model, the dataset we selected comprises 17.64k hours of news c
52
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64df9253cccd823564c3303b/O2RA9TAedIv1OTqgdIap5.png)
53
 
54
  ## Citation
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
52
  ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64df9253cccd823564c3303b/O2RA9TAedIv1OTqgdIap5.png)
53
 
54
  ## Citation
55
+ ```
56
+ @inproceedings{nandi-etal-2023-pseudo,
57
+ title = "Pseudo-Labeling for Domain-Agnostic {B}angla Automatic Speech Recognition",
58
+ author = "Nandi, Rabindra Nath and
59
+ Menon, Mehadi and
60
+ Muntasir, Tareq and
61
+ Sarker, Sagor and
62
+ Muhtaseem, Quazi Sarwar and
63
+ Islam, Md. Tariqul and
64
+ Chowdhury, Shammur and
65
+ Alam, Firoj",
66
+ editor = "Alam, Firoj and
67
+ Kar, Sudipta and
68
+ Chowdhury, Shammur Absar and
69
+ Sadeque, Farig and
70
+ Amin, Ruhul",
71
+ booktitle = "Proceedings of the First Workshop on Bangla Language Processing (BLP-2023)",
72
+ month = dec,
73
+ year = "2023",
74
+ address = "Singapore",
75
+ publisher = "Association for Computational Linguistics",
76
+ url = "https://aclanthology.org/2023.banglalp-1.16",
77
+ doi = "10.18653/v1/2023.banglalp-1.16",
78
+ pages = "152--162",
79
+ abstract = "One of the major challenges for developing automatic speech recognition (ASR) for low-resource languages is the limited access to labeled data with domain-specific variations. In this study, we propose a pseudo-labeling approach to develop a large-scale domain-agnostic ASR dataset. With the proposed methodology, we developed a 20k+ hours labeled Bangla speech dataset covering diverse topics, speaking styles, dialects, noisy environments, and conversational scenarios. We then exploited the developed corpus to design a conformer-based ASR system. We benchmarked the trained ASR with publicly available datasets and compared it with other available models. To investigate the efficacy, we designed and developed a human-annotated domain-agnostic test set composed of news, telephony, and conversational data among others. Our results demonstrate the efficacy of the model trained on psuedo-label data for the designed test-set along with publicly-available Bangla datasets. The experimental resources will be publicly available.https://github.com/hishab-nlp/Pseudo-Labeling-for-Domain-Agnostic-Bangla-ASR",
80
+ }
81
+ ```