File size: 8,168 Bytes
dfe420b
 
 
 
 
 
 
 
56d0c46
 
 
dfe420b
 
 
 
 
 
 
 
 
 
 
 
 
ca1894b
 
 
 
 
e324b85
dfe420b
0cd61be
 
 
e5a2413
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8d9ea38
e5a2413
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
dfe420b
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
---
language:
- tl
library_name: transformers
pipeline_tag: token-classification
tags:
- language
- grammar
widget:
- text: "Ang bahay ay lumiliwanag na para bang may kumikislap sa bintana"
- text: "Naisip ko na kumain na lang tayo sa pinakasikat na restaurant sa Manila"
---
# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->

This is a BERT Tagalog Base Uncased Part of Speech tagger fine-tuned model of [Jiang et. al.'s pre-trained bert-tagalog-base-uncased model](https://huggingface.co/GKLMIP/bert-tagalog-base-uncased).

## Model Details

### Model Description

<!-- Provide a longer summary of what this model is. -->

- **Developed by:** syke9p3, mnemoria, xenoxia, riakm
- **Shared by:** syke9p3
- **Model type:** BERT Tagalog Base Uncased
- **Languages (NLP):** Tagalog, Filipino
- **Dataset:** Sagum et. al.'s annotated Tagalog Corpora based on MGNN Tagset convention. This model was trained in 800 sentences and evaluated with 200 sentences.
- **Finetuned from model**: [Jiang et. al.'s pre-trained bert-tagalog-base-uncased model](https://huggingface.co/GKLMIP/bert-tagalog-base-uncased)

## GitHub Link
[syke9p3/bert-tagalog-pos-tagger](https://github.com/syke9p3/bert-tagalog-pos-tagger)

### Tags

  | Part of Speech                                 | Tags |
  |-----------------------------------------------|------|
  | **Noun**                                   | NNC  |
  | Common Noun                                   | NNC  |
  | Proper Noun                                   | NNP  |
  | Proper Noun Abbreviation                      | NNPA |
  | Common Noun Abbreviation                      | NNCA |
  | **Pronoun**                                       | PR   |
  | as Subject (Palagyo)/Personal Pronouns Singular | PRS  |
  | Personal Pronouns                             | PRP  |
  | Possessive Subject (Paari)                    | PRSP |
  | Pointing to an Object Demonstrative/(Paturol/Pamatlig) | PRO  |
  | Question/Interrogative (Pananong)/Singular    | PRQ  |
  | Question/Interrogative Plural                 | PRQP |
  | Location (Panlunan)                           | PRL  |
  | Comparison (Panulad)                          | PRC  |
  | Found (Pahimaton)                             | PRF  |
  | Indefinite                                    | PRI  |
  | **Determiner**       | DT   |
  | Determiner (Pantukoy) for Common Noun Plural  | DTC  |
  | Determiner (Pantukoy) for Proper Noun         | DTP  |
  | Determiner (Pantukoy) for Proper Noun Plural  | DTPP |
  | Lexical Marker                                | LM   |
  | Conjunctions (Pang-ugnay)                     | CC, CCT, CCR, CCB, CCA |
  | Ligatures (Pang-angkop)                       | CCP  |
  | Preposition (Pang-ukol)                       | CCU  |
  | **Verb (Pandiwa)**                                | VB   |
  | Neutral/Infinitive                            | VBW  |
  | Auxiliary, Modal/Pseudo-verbs                 | VBS  |
  | Existential                                  | VBH  |
  | Non-existential                              | VBN  |
  | Time Past (Perfective)                       | VBTS |
  | Time Present (Imperfective)                  | VBTR |
  | Time Future (Contemplative)                  | VBTF |
  | Recent past                                  | VBTP |
  | Actor Focus                                  | VBAF |
  | Object/Goal Focus                            | VBOF |
  | Benefactive Focus                            | VBOB |
  | Locative Focus                               | VBOL |
  | Instrumental Focus                           | VBOI |
  | Referential/Measurement Focus                | VBRF |
  | **Adjective (Pang-uri)**                         | JJ   |
  | Describing (Panlarawan)                      | JJD  |
  | Used for Comparison (same level) (Pahambing Magkatulad) | JJC  |
  | Comparison Comparative (more) (Palamang)     | JJCC |
  | Comparison Superlative (most) (Pasukdol)      | JJCS |
  | Comparison Negation (not quite) (Di-Magkatulad) | JJCN |
  | Describing Number (Pamilang)                 | JJN  |
  | **Adverb (Pang-Abay)**                           | RB   |
  | Describing “How” (Pamaraan)                  | RBD  |
  | Number (Panggaano/Panukat)                   | RBN  |
  | Conditional (Kondisyunal)                   | RBK  |
  | Causative (Pananhi)                          | RBP  |
  | Benefactive (Benepaktibo)                    | RBB  |
  | Referential (Pangkaukulan)                   | RBR  |
  | Question (Pananong)                          | RBQ  |
  | Agree (Panang-ayon)                          | RBT  |
  | Disagree (Pananggi)                          | RBF  |
  | Frequency (Pamanahon)                        | RBW  |
  | Possibility (Pang-agam)                      | RBM  |
  | Place (Panlunan)                             | RBL  |
  | Enclitics (Paningit)                         | RBI  |
  | Interjections (Sambitla)                     | RBJ  |
  | Social Formula (Pormularyong Panlipunan)     | RBS  |
  |**Cardinal Number (Bilang)**                    | CD   |
  | Digit, Rank, Count                           | CDB  |
  | **Topicless (Walang Paksa)**                     | TS   |
  | Foreign Words                                | FW   |
  | **Punctuation (Pananda)**                        | PM   |
  | Period                                       | PMP  |
  | Exclamation Point                            | PME  |
  | Question Mark                                | PMQ  |
  | Comma                                        | PMC  |
  | Semi-colon                                   | PMSC |
  | **Symbols**                                      | PMS  |
  
  
  | Compound Tags  |  \<tag1>\<tag2>...\<tagN>      |  |   |     |
  |------------|---------------|----------|------------|------------|
  | CCB_CCP    | JJCS_VBRF_CCP | PRI_CCT  | RBL_JJD    | VBS_CCP    |
  | CCR_CCA    | JJCS_VBTR     | PRI_LM   | RBL_JJD_CCP| VBTF_CCP   |
  | CCR_CCB    | JJCS_VBTR_VBOF| PRL_CCP  | RBL_NNC    | VBTF_JJD   |
  | CCR_CCP    | JJCS_VBTR_VBRF| PRL_LM   | RBL_NNP    | VBTF_VBAF  |
  | CCR_LM     | JJCS_VBTS     | PRO_CCB  | RBL_NNPA   | VBTF_VBOB  |
  | CCT_CCA    | JJCS_VBW      | PRO_CCP  | RBL_NNP_NNP| VBTF_VBOF  |
  | CCT_CCP    | JJC_CCB       | PRO_LM   | RBL_PRL    | VBTF_VBOF_CCP |
  | CCT_LM     | JJC_CCP       | PRP_CCB  | RBM_CCP    | VBTR_CCP   |
  | CCU_DTP    | JJC_JJD       | PRP_CCP  | RBM_LM     | VBTR_VBAF  |
  | CDB_CCA    | JJC_PRL       | PRP_LM   | RBN_CCP    | VBTR_VBAF_CCP |
  | CDB_CCP    | JJD_CCA       | PRQ_CCP  | RBP_CCP    | VBTR_VBOB  |
  | CDB_LM     | JJD_CCB       | PRQ_LM   | RBQ_CCB    | VBTR_VBOF  |
  | CDB_NNC    | JJD_CCP       | PRSP_CCP | RBQ_CCP    | VBTR_VBOF_CCP |
  | CDB_NNC_CCP| JJD_CCT       | PRS_CCB  | RBQ_LM     | VBTR_VBRF  |
  | JJCC_CCP   | JJD_NNC       | PRS_CCP  | RBR_DTP    | VBTR_VBRF_CCP |
  | JJCC_JJD   | JJD_NNP       | PRS_LM   | RBS_CCP    | VBTS_CCA   |
  | JJCN_CCP   | JJN_CCA       | RBD_CCB  | RBT_CCB    | VBTS_CCP   |
  | JJCN_LM    | JJN_CCB       | RBD_CCP  | RBT_CCP    | VBTS_JJD   |
  | JJCS_CCB   | JJN_CCP       | RBD_LM   | RBT_CCT    | VBTS_LM    |
  | JJCS_CCP   | JJN_NNC       | RBF_CCP  | RBT_LM     | VBTS_VBAF  |
  | JJCS_JJC   | JJN_NNC_CCP   | RBF_JJD  | RBW_CCA    | VBTS_VBOB  |
  | JJCS_JJC_CCP| JJN_NNP       | RBF_JJD_CCP | RBW_CCB    | VBTS_VBOF  |
  | JJCS_JJD   | NNC_CCA       | RBF_LM   | RBW_CCP    | VBTS_VBOF_CCP |
  | JJCS_JJD_CCB| NNC_CCB       | RBF_RBW  | RBW_DTP    | VBTS_VBOL  |
  | JJCS_JJD_CCP| NNC_CCP       | RBF_VBTR | RBW_LM     | VBTS_VBRF  |
  | JJCS_JJD_NNC| NNC_LM        | RBF_VBW_CCP| RBW_RBI    | VBW_CCB    |
  | JJCS_JJN   | NNC_PMC       | RBI_CCA  | VBAF_CCP   | VBW_CCP    |
  | JJCS_JJN_CCP| NNP_CCA       | RBI_CCP  | VBH_CCB    | VBW_CDB    |
  | JJCS_RBF   | NNP_CCP       | RBI_LM   | VBH_CCP    | VBW_LM     |
  | JJCS_VBAF  | PRC_CCB       | RBJ_CCP  | VBN_CCP    |            |
  | JJCS_VBAF_CCP| PRC_CCP      | RBK_LM   | VBOB_CCP   |            |
  | JJCS_VBN_CCP| PRI_CCB       | RBL_CCP
 




## Bias, Risks, and Limitations

This model has not been fully tested so please use with caution.