Update model
Browse files
README.md
CHANGED
@@ -13,9 +13,9 @@ widget:
|
|
13 |
# POET: A French Extended Part-of-Speech Tagger
|
14 |
|
15 |
- Corpora: [ANTILLES](https://github.com/qanastek/ANTILLES)
|
16 |
-
- Embeddings: [
|
17 |
- Sequence Labelling: [Bi-LSTM-CRF](https://arxiv.org/abs/1011.4088)
|
18 |
-
- Number of Epochs:
|
19 |
|
20 |
**People Involved**
|
21 |
|
@@ -143,76 +143,78 @@ The test corpora used for this evaluation is available on [Github](https://githu
|
|
143 |
|
144 |
```plain
|
145 |
Results:
|
146 |
-
- F-score (micro)
|
147 |
-
- F-score (macro)
|
148 |
-
- Accuracy
|
149 |
|
150 |
By class:
|
151 |
precision recall f1-score support
|
152 |
-
|
153 |
-
|
154 |
-
|
155 |
-
|
156 |
-
|
157 |
-
|
158 |
-
|
159 |
-
|
160 |
-
|
161 |
-
|
162 |
-
|
163 |
-
|
164 |
-
|
165 |
-
|
166 |
-
|
167 |
-
DINTMS 0.9919 1.0000 0.9959 122
|
168 |
-
ADJMS 0.9020 0.9057 0.9039 244
|
169 |
-
NMS 0.9226 0.9336 0.9281 753
|
170 |
-
NFS 0.9347 0.9714 0.9527 560
|
171 |
-
YPFOR 0.9806 1.0000 0.9902 353
|
172 |
-
PINDMS 1.0000 0.9091 0.9524 44
|
173 |
-
NOUN 0.8400 0.5385 0.6562 39
|
174 |
-
PROPN 0.8605 0.8278 0.8439 395
|
175 |
-
DETMS 0.9972 0.9972 0.9972 362
|
176 |
-
PPER3MS 0.9341 0.9770 0.9551 87
|
177 |
-
VPPMS 0.8994 0.9682 0.9325 157
|
178 |
DETFS 1.0000 1.0000 1.0000 240
|
179 |
-
|
180 |
-
|
181 |
-
|
182 |
-
|
183 |
-
|
184 |
-
|
185 |
-
|
186 |
-
|
187 |
-
|
188 |
-
|
189 |
-
|
190 |
-
|
191 |
-
|
192 |
-
|
193 |
-
|
194 |
-
|
195 |
-
|
196 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
197 |
PPER3FP 1.0000 1.0000 1.0000 2
|
198 |
-
MOTINC 0.3571 0.3333 0.3448 15
|
199 |
-
PDEMMP 1.0000 0.6667 0.8000 3
|
200 |
-
INTJ 0.4000 0.6667 0.5000 6
|
201 |
-
PREFS 1.0000 0.5000 0.6667 10
|
202 |
-
ADJ 0.7917 0.8636 0.8261 22
|
203 |
-
PINDMP 0.0000 0.0000 0.0000 1
|
204 |
-
PINDFS 1.0000 1.0000 1.0000 1
|
205 |
NUM 1.0000 0.3333 0.5000 3
|
206 |
PPER2S 1.0000 1.0000 1.0000 2
|
207 |
-
PPOBJFP
|
208 |
-
PDEMFP 1.0000 0.6667 0.8000 3
|
209 |
-
X 0.0000 0.0000 0.0000 1
|
210 |
PRELMS 1.0000 1.0000 1.0000 2
|
|
|
|
|
|
|
211 |
PINDFP 1.0000 1.0000 1.0000 1
|
212 |
|
213 |
-
|
214 |
-
macro avg 0.
|
215 |
-
weighted avg 0.
|
|
|
216 |
```
|
217 |
|
218 |
## BibTeX Citations
|
|
|
13 |
# POET: A French Extended Part-of-Speech Tagger
|
14 |
|
15 |
- Corpora: [ANTILLES](https://github.com/qanastek/ANTILLES)
|
16 |
+
- Embeddings: [Flair](https://aclanthology.org/C18-1139.pdf) & [CamemBERT](https://arxiv.org/abs/1911.03894)
|
17 |
- Sequence Labelling: [Bi-LSTM-CRF](https://arxiv.org/abs/1011.4088)
|
18 |
+
- Number of Epochs: 50
|
19 |
|
20 |
**People Involved**
|
21 |
|
|
|
143 |
|
144 |
```plain
|
145 |
Results:
|
146 |
+
- F-score (micro) 0.9797
|
147 |
+
- F-score (macro) 0.9178
|
148 |
+
- Accuracy 0.9797
|
149 |
|
150 |
By class:
|
151 |
precision recall f1-score support
|
152 |
+
|
153 |
+
PREP 0.9966 0.9987 0.9976 1483
|
154 |
+
PUNCT 1.0000 1.0000 1.0000 833
|
155 |
+
NMS 0.9634 0.9801 0.9717 753
|
156 |
+
DET 0.9923 0.9984 0.9954 645
|
157 |
+
VERB 0.9913 0.9811 0.9862 583
|
158 |
+
NFS 0.9667 0.9839 0.9752 560
|
159 |
+
ADV 0.9940 0.9821 0.9880 504
|
160 |
+
PROPN 0.9541 0.8937 0.9229 395
|
161 |
+
DETMS 1.0000 1.0000 1.0000 362
|
162 |
+
AUX 0.9860 0.9915 0.9888 355
|
163 |
+
YPFOR 1.0000 1.0000 1.0000 353
|
164 |
+
NMP 0.9666 0.9475 0.9570 305
|
165 |
+
COCO 0.9959 1.0000 0.9980 245
|
166 |
+
ADJMS 0.9463 0.9385 0.9424 244
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
167 |
DETFS 1.0000 1.0000 1.0000 240
|
168 |
+
CHIF 0.9648 0.9865 0.9755 222
|
169 |
+
NFP 0.9515 0.9849 0.9679 199
|
170 |
+
ADJFS 0.9657 0.9286 0.9468 182
|
171 |
+
VPPMS 0.9387 0.9745 0.9563 157
|
172 |
+
COSUB 1.0000 0.9844 0.9921 128
|
173 |
+
DINTMS 0.9918 0.9918 0.9918 122
|
174 |
+
XFAMIL 0.9298 0.9217 0.9258 115
|
175 |
+
PPER3MS 1.0000 1.0000 1.0000 87
|
176 |
+
ADJMP 0.9294 0.9634 0.9461 82
|
177 |
+
PDEMMS 1.0000 1.0000 1.0000 75
|
178 |
+
ADJFP 0.9861 0.9342 0.9595 76
|
179 |
+
PREL 0.9859 1.0000 0.9929 70
|
180 |
+
DINTFS 0.9839 1.0000 0.9919 61
|
181 |
+
PREF 1.0000 1.0000 1.0000 52
|
182 |
+
PPOBJMS 0.9565 0.9362 0.9462 47
|
183 |
+
PREFP 0.9778 1.0000 0.9888 44
|
184 |
+
PINDMS 1.0000 0.9773 0.9885 44
|
185 |
+
VPPFS 0.8298 0.9750 0.8966 40
|
186 |
+
PPER1S 1.0000 1.0000 1.0000 42
|
187 |
+
SYM 1.0000 0.9474 0.9730 38
|
188 |
+
NOUN 0.8824 0.7692 0.8219 39
|
189 |
+
PRON 1.0000 0.9677 0.9836 31
|
190 |
+
PDEMFS 1.0000 1.0000 1.0000 29
|
191 |
+
VPPMP 0.9286 1.0000 0.9630 26
|
192 |
+
ADJ 0.9524 0.9091 0.9302 22
|
193 |
+
PPER3MP 1.0000 1.0000 1.0000 20
|
194 |
+
VPPFP 1.0000 1.0000 1.0000 19
|
195 |
+
PPER3FS 1.0000 1.0000 1.0000 18
|
196 |
+
MOTINC 0.3333 0.4000 0.3636 15
|
197 |
+
PREFS 1.0000 1.0000 1.0000 10
|
198 |
+
PPOBJMP 1.0000 0.8000 0.8889 10
|
199 |
+
PPOBJFS 0.6250 0.8333 0.7143 6
|
200 |
+
INTJ 0.5000 0.6667 0.5714 6
|
201 |
+
PART 1.0000 1.0000 1.0000 4
|
202 |
+
PDEMMP 1.0000 1.0000 1.0000 3
|
203 |
+
PDEMFP 1.0000 1.0000 1.0000 3
|
204 |
PPER3FP 1.0000 1.0000 1.0000 2
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
205 |
NUM 1.0000 0.3333 0.5000 3
|
206 |
PPER2S 1.0000 1.0000 1.0000 2
|
207 |
+
PPOBJFP 0.5000 0.5000 0.5000 2
|
|
|
|
|
208 |
PRELMS 1.0000 1.0000 1.0000 2
|
209 |
+
PINDFS 0.5000 1.0000 0.6667 1
|
210 |
+
PINDMP 1.0000 1.0000 1.0000 1
|
211 |
+
X 0.0000 0.0000 0.0000 1
|
212 |
PINDFP 1.0000 1.0000 1.0000 1
|
213 |
|
214 |
+
micro avg 0.9797 0.9797 0.9797 10019
|
215 |
+
macro avg 0.9228 0.9230 0.9178 10019
|
216 |
+
weighted avg 0.9802 0.9797 0.9798 10019
|
217 |
+
samples avg 0.9797 0.9797 0.9797 10019
|
218 |
```
|
219 |
|
220 |
## BibTeX Citations
|