marzinouri commited on
Commit
0ab091c
·
1 Parent(s): 189e9ce

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Iranian Azerbaijani NLP Models
2
+
3
+ ## Github Repository: [iranian-azerbaijani-nlp](https://github.com/language-ml/iranian-azerbaijani-nlp)
4
+
5
+ ## Overview
6
+
7
+ This model card provides information about the NLP models developed as part of the paper accepted for publication at AACL 2023. The models are designed to support Natural Language Processing (NLP) tasks for the Iranian Azerbaijani language (ISO code: azb). The models included in this repository are:
8
+
9
+ 1. **AzerBERT**
10
+ - Type: BERT-based language model transformer
11
+ - Description: AzerBERT is a pre-trained language model specifically tailored for the Iranian Azerbaijani language. It can be used for various NLP tasks, including text classification, named entity recognition, and more.
12
+ - Model Link: [AzerBERT Model](https://huggingface.co/language-ml-lab/AzerBert)
13
+
14
+ 2. **Language Model-based Embedding (FastText)**
15
+ - Type: FastText-based word embedding model
16
+ - Description: This model provides embeddings for Iranian Azerbaijani text using the FastText framework. It allows you to generate word embeddings for Iranian Azerbaijani words and phrases.
17
+ - Model Link: [FastText Embedding Model](https://huggingface.co/language-ml-lab/fasttext-azb)
18
+
19
+ 3. **Text Classification Model (Fine-tuned with AzerBERT)**
20
+ - Type: Fine-tuned BERT-based text classification model
21
+ - Description: This model has been fine-tuned using AzerBERT for text classification tasks. It is designed to categorize text into one of the following four categories: literature, sports, history, and geography.
22
+ - Model Link: [Text Classification Model](https://huggingface.co/language-ml-lab/classification-azb)
23
+
24
+ 4. **POS Tagger (Fine-tuned with AzerBERT)**
25
+ - Type: Fine-tuned BERT-based Part-of-Speech (POS) tagging model
26
+ - Description: This model has been fine-tuned using AzerBERT for part-of-speech tagging tasks in Iranian Azerbaijani text. It can be used to annotate text with 11 POS tags, which is essential for various downstream NLP applications.
27
+ - Model Link: [POS Tagger Model](https://huggingface.co/language-ml-lab/postagger-azb)
28
+
29
+ 5. **Translation Models (Persian to Azerbaijani and Vice Versa)**
30
+ - Type: Machine translation models
31
+ - Description: These models support translation between Persian (fa) and Iranian Azerbaijani (azb) languages. They enable bidirectional translation between the two languages, making them valuable for cross-language communication.
32
+ - Model Link: [Translation Models](https://huggingface.co/language-ml-lab/translation-azb-fa)
33
+
34
+ ## Model Training Data
35
+
36
+ The details about the training data used to pre-train and fine-tune these models can be found in the associated research paper. Please refer to the paper for comprehensive information about the data sources and preprocessing steps.
37
+
38
+ ## Evaluation Metrics
39
+
40
+ The evaluation metrics for each specific model, including accuracy, F1-score, BLEU score, or other relevant metrics, are provided in the associated research paper.
41
+
42
+ ## Acknowledgments
43
+
44
+ Please acknowledge the authors and cite the associated research paper when using these models in your work. Proper attribution helps recognize the effort and contributions of the researchers involved in model development.
45
+
46
+ ## Citation
47
+
48
+ If you use these models in your research or applications, please cite the following paper:
49
+
50
+ [Include the citation for your AACL 2023 paper here]
51
+
52
+ ---