ljvmiranda921
commited on
Commit
•
132e489
1
Parent(s):
71b12a7
Update README.md
Browse files
README.md
CHANGED
@@ -58,7 +58,16 @@ model-index:
|
|
58 |
type: f_score
|
59 |
value: 0.9977571291
|
60 |
---
|
61 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
62 |
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
|
|
58 |
type: f_score
|
59 |
value: 0.9977571291
|
60 |
---
|
61 |
+
|
62 |
+
<img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
|
63 |
+
|
64 |
+
# UD Parser (Multilingual context-sensitive vectors + transition-based parser)
|
65 |
+
|
66 |
+
This is the spaCy pipeline trained on [UD-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl).
|
67 |
+
It uses [multi hash embeddings](https://arxiv.org/abs/2212.09255) using [floret](https://github.com/explosion/floret).
|
68 |
+
It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
|
69 |
+
The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
|
70 |
+
More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
|
71 |
|
72 |
| Feature | Description |
|
73 |
| --- | --- |
|