Update README.md
Browse files
README.md
CHANGED
@@ -26,6 +26,123 @@ This is a BERT Tagalog Base Uncased Part of Speech tagger fine-tuned model of [J
|
|
26 |
- **Dataset:** Sagum et. al.'s annotated Tagalog Corpora based on MGNN Tagset convention. This model was trained in 800 sentences and evaluated with 200 sentences.
|
27 |
- **Finetuned from model GKLMIP/bert-tagalog-base-uncased:** [https://huggingface.co/GKLMIP/bert-tagalog-base-uncased]
|
28 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
|
30 |
## Bias, Risks, and Limitations
|
31 |
|
|
|
26 |
- **Dataset:** Sagum et. al.'s annotated Tagalog Corpora based on MGNN Tagset convention. This model was trained in 800 sentences and evaluated with 200 sentences.
|
27 |
- **Finetuned from model GKLMIP/bert-tagalog-base-uncased:** [https://huggingface.co/GKLMIP/bert-tagalog-base-uncased]
|
28 |
|
29 |
+
### Tags
|
30 |
+
|
31 |
+
| Part of Speech | Tags |
|
32 |
+
|-----------------------------------------------|------|
|
33 |
+
| **Noun** | NNC |
|
34 |
+
| Common Noun | NNC |
|
35 |
+
| Proper Noun | NNP |
|
36 |
+
| Proper Noun Abbreviation | NNPA |
|
37 |
+
| Common Noun Abbreviation | NNCA |
|
38 |
+
| **Pronoun** | PR |
|
39 |
+
| as Subject (Palagyo)/Personal Pronouns Singular | PRS |
|
40 |
+
| Personal Pronouns | PRP |
|
41 |
+
| Possessive Subject (Paari) | PRSP |
|
42 |
+
| Pointing to an Object Demonstrative/(Paturol/Pamatlig) | PRO |
|
43 |
+
| Question/Interrogative (Pananong)/Singular | PRQ |
|
44 |
+
| Question/Interrogative Plural | PRQP |
|
45 |
+
| Location (Panlunan) | PRL |
|
46 |
+
| Comparison (Panulad) | PRC |
|
47 |
+
| Found (Pahimaton) | PRF |
|
48 |
+
| Indefinite | PRI |
|
49 |
+
| **Determiner** | DT |
|
50 |
+
| Determiner (Pantukoy) for Common Noun Plural | DTC |
|
51 |
+
| Determiner (Pantukoy) for Proper Noun | DTP |
|
52 |
+
| Determiner (Pantukoy) for Proper Noun Plural | DTPP |
|
53 |
+
| Lexical Marker | LM |
|
54 |
+
| Conjunctions (Pang-ugnay) | CC, CCT, CCR, CCB, CCA |
|
55 |
+
| Ligatures (Pang-angkop) | CCP |
|
56 |
+
| Preposition (Pang-ukol) | CCU |
|
57 |
+
| **Verb (Pandiwa)** | VB |
|
58 |
+
| Neutral/Infinitive | VBW |
|
59 |
+
| Auxiliary, Modal/Pseudo-verbs | VBS |
|
60 |
+
| Existential | VBH |
|
61 |
+
| Non-existential | VBN |
|
62 |
+
| Time Past (Perfective) | VBTS |
|
63 |
+
| Time Present (Imperfective) | VBTR |
|
64 |
+
| Time Future (Contemplative) | VBTF |
|
65 |
+
| Recent past | VBTP |
|
66 |
+
| Actor Focus | VBAF |
|
67 |
+
| Object/Goal Focus | VBOF |
|
68 |
+
| Benefactive Focus | VBOB |
|
69 |
+
| Locative Focus | VBOL |
|
70 |
+
| Instrumental Focus | VBOI |
|
71 |
+
| Referential/Measurement Focus | VBRF |
|
72 |
+
| **Adjective (Pang-uri)** | JJ |
|
73 |
+
| Describing (Panlarawan) | JJD |
|
74 |
+
| Used for Comparison (same level) (Pahambing Magkatulad) | JJC |
|
75 |
+
| Comparison Comparative (more) (Palamang) | JJCC |
|
76 |
+
| Comparison Superlative (most) (Pasukdol) | JJCS |
|
77 |
+
| Comparison Negation (not quite) (Di-Magkatulad) | JJCN |
|
78 |
+
| Describing Number (Pamilang) | JJN |
|
79 |
+
| **Adverb (Pang-Abay)** | RB |
|
80 |
+
| Describing “How” (Pamaraan) | RBD |
|
81 |
+
| Number (Panggaano/Panukat) | RBN |
|
82 |
+
| Conditional (Kondisyunal) | RBK |
|
83 |
+
| Causative (Pananhi) | RBP |
|
84 |
+
| Benefactive (Benepaktibo) | RBB |
|
85 |
+
| Referential (Pangkaukulan) | RBR |
|
86 |
+
| Question (Pananong) | RBQ |
|
87 |
+
| Agree (Panang-ayon) | RBT |
|
88 |
+
| Disagree (Pananggi) | RBF |
|
89 |
+
| Frequency (Pamanahon) | RBW |
|
90 |
+
| Possibility (Pang-agam) | RBM |
|
91 |
+
| Place (Panlunan) | RBL |
|
92 |
+
| Enclitics (Paningit) | RBI |
|
93 |
+
| Interjections (Sambitla) | RBJ |
|
94 |
+
| Social Formula (Pormularyong Panlipunan) | RBS |
|
95 |
+
|**Cardinal Number (Bilang)** | CD |
|
96 |
+
| Digit, Rank, Count | CDB |
|
97 |
+
| **Topicless (Walang Paksa)** | TS |
|
98 |
+
| Foreign Words | FW |
|
99 |
+
| **Punctuation (Pananda)** | PM |
|
100 |
+
| Period | PMP |
|
101 |
+
| Exclamation Point | PME |
|
102 |
+
| Question Mark | PMQ |
|
103 |
+
| Comma | PMC |
|
104 |
+
| Semi-colon | PMSC |
|
105 |
+
| **Symbols** | PMS |
|
106 |
+
|
107 |
+
|
108 |
+
| Compound Tags | | | | |
|
109 |
+
|------------|---------------|----------|------------|------------|
|
110 |
+
| CCB_CCP | JJCS_VBRF_CCP | PRI_CCT | RBL_JJD | VBS_CCP |
|
111 |
+
| CCR_CCA | JJCS_VBTR | PRI_LM | RBL_JJD_CCP| VBTF_CCP |
|
112 |
+
| CCR_CCB | JJCS_VBTR_VBOF| PRL_CCP | RBL_NNC | VBTF_JJD |
|
113 |
+
| CCR_CCP | JJCS_VBTR_VBRF| PRL_LM | RBL_NNP | VBTF_VBAF |
|
114 |
+
| CCR_LM | JJCS_VBTS | PRO_CCB | RBL_NNPA | VBTF_VBOB |
|
115 |
+
| CCT_CCA | JJCS_VBW | PRO_CCP | RBL_NNP_NNP| VBTF_VBOF |
|
116 |
+
| CCT_CCP | JJC_CCB | PRO_LM | RBL_PRL | VBTF_VBOF_CCP |
|
117 |
+
| CCT_LM | JJC_CCP | PRP_CCB | RBM_CCP | VBTR_CCP |
|
118 |
+
| CCU_DTP | JJC_JJD | PRP_CCP | RBM_LM | VBTR_VBAF |
|
119 |
+
| CDB_CCA | JJC_PRL | PRP_LM | RBN_CCP | VBTR_VBAF_CCP |
|
120 |
+
| CDB_CCP | JJD_CCA | PRQ_CCP | RBP_CCP | VBTR_VBOB |
|
121 |
+
| CDB_LM | JJD_CCB | PRQ_LM | RBQ_CCB | VBTR_VBOF |
|
122 |
+
| CDB_NNC | JJD_CCP | PRSP_CCP | RBQ_CCP | VBTR_VBOF_CCP |
|
123 |
+
| CDB_NNC_CCP| JJD_CCT | PRS_CCB | RBQ_LM | VBTR_VBRF |
|
124 |
+
| JJCC_CCP | JJD_NNC | PRS_CCP | RBR_DTP | VBTR_VBRF_CCP |
|
125 |
+
| JJCC_JJD | JJD_NNP | PRS_LM | RBS_CCP | VBTS_CCA |
|
126 |
+
| JJCN_CCP | JJN_CCA | RBD_CCB | RBT_CCB | VBTS_CCP |
|
127 |
+
| JJCN_LM | JJN_CCB | RBD_CCP | RBT_CCP | VBTS_JJD |
|
128 |
+
| JJCS_CCB | JJN_CCP | RBD_LM | RBT_CCT | VBTS_LM |
|
129 |
+
| JJCS_CCP | JJN_NNC | RBF_CCP | RBT_LM | VBTS_VBAF |
|
130 |
+
| JJCS_JJC | JJN_NNC_CCP | RBF_JJD | RBW_CCA | VBTS_VBOB |
|
131 |
+
| JJCS_JJC_CCP| JJN_NNP | RBF_JJD_CCP | RBW_CCB | VBTS_VBOF |
|
132 |
+
| JJCS_JJD | NNC_CCA | RBF_LM | RBW_CCP | VBTS_VBOF_CCP |
|
133 |
+
| JJCS_JJD_CCB| NNC_CCB | RBF_RBW | RBW_DTP | VBTS_VBOL |
|
134 |
+
| JJCS_JJD_CCP| NNC_CCP | RBF_VBTR | RBW_LM | VBTS_VBRF |
|
135 |
+
| JJCS_JJD_NNC| NNC_LM | RBF_VBW_CCP| RBW_RBI | VBW_CCB |
|
136 |
+
| JJCS_JJN | NNC_PMC | RBI_CCA | VBAF_CCP | VBW_CCP |
|
137 |
+
| JJCS_JJN_CCP| NNP_CCA | RBI_CCP | VBH_CCB | VBW_CDB |
|
138 |
+
| JJCS_RBF | NNP_CCP | RBI_LM | VBH_CCP | VBW_LM |
|
139 |
+
| JJCS_VBAF | PRC_CCB | RBJ_CCP | VBN_CCP | |
|
140 |
+
| JJCS_VBAF_CCP| PRC_CCP | RBK_LM | VBOB_CCP | |
|
141 |
+
| JJCS_VBN_CCP| PRI_CCB | RBL_CCP
|
142 |
+
|
143 |
+
|
144 |
+
|
145 |
+
|
146 |
|
147 |
## Bias, Risks, and Limitations
|
148 |
|