sagorsarker
commited on
Commit
•
5d3b81e
1
Parent(s):
1bb676c
Update README.md
Browse files
README.md
CHANGED
@@ -52,3 +52,30 @@ For training the model, the dataset we selected comprises 17.64k hours of news c
|
|
52 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64df9253cccd823564c3303b/O2RA9TAedIv1OTqgdIap5.png)
|
53 |
|
54 |
## Citation
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
52 |
![image/png](https://cdn-uploads.huggingface.co/production/uploads/64df9253cccd823564c3303b/O2RA9TAedIv1OTqgdIap5.png)
|
53 |
|
54 |
## Citation
|
55 |
+
```
|
56 |
+
@inproceedings{nandi-etal-2023-pseudo,
|
57 |
+
title = "Pseudo-Labeling for Domain-Agnostic {B}angla Automatic Speech Recognition",
|
58 |
+
author = "Nandi, Rabindra Nath and
|
59 |
+
Menon, Mehadi and
|
60 |
+
Muntasir, Tareq and
|
61 |
+
Sarker, Sagor and
|
62 |
+
Muhtaseem, Quazi Sarwar and
|
63 |
+
Islam, Md. Tariqul and
|
64 |
+
Chowdhury, Shammur and
|
65 |
+
Alam, Firoj",
|
66 |
+
editor = "Alam, Firoj and
|
67 |
+
Kar, Sudipta and
|
68 |
+
Chowdhury, Shammur Absar and
|
69 |
+
Sadeque, Farig and
|
70 |
+
Amin, Ruhul",
|
71 |
+
booktitle = "Proceedings of the First Workshop on Bangla Language Processing (BLP-2023)",
|
72 |
+
month = dec,
|
73 |
+
year = "2023",
|
74 |
+
address = "Singapore",
|
75 |
+
publisher = "Association for Computational Linguistics",
|
76 |
+
url = "https://aclanthology.org/2023.banglalp-1.16",
|
77 |
+
doi = "10.18653/v1/2023.banglalp-1.16",
|
78 |
+
pages = "152--162",
|
79 |
+
abstract = "One of the major challenges for developing automatic speech recognition (ASR) for low-resource languages is the limited access to labeled data with domain-specific variations. In this study, we propose a pseudo-labeling approach to develop a large-scale domain-agnostic ASR dataset. With the proposed methodology, we developed a 20k+ hours labeled Bangla speech dataset covering diverse topics, speaking styles, dialects, noisy environments, and conversational scenarios. We then exploited the developed corpus to design a conformer-based ASR system. We benchmarked the trained ASR with publicly available datasets and compared it with other available models. To investigate the efficacy, we designed and developed a human-annotated domain-agnostic test set composed of news, telephony, and conversational data among others. Our results demonstrate the efficacy of the model trained on psuedo-label data for the designed test-set along with publicly-available Bangla datasets. The experimental resources will be publicly available.https://github.com/hishab-nlp/Pseudo-Labeling-for-Domain-Agnostic-Bangla-ASR",
|
80 |
+
}
|
81 |
+
```
|