UD-Filipino
/

tl_hash_transition

Token Classification

Model card Files Files and versions Community

ljvmiranda921 commited on 16 days ago

Commit

132e489

•

1 Parent(s): 71b12a7

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -58,7 +58,16 @@ model-index:
       type: f_score
       value: 0.9977571291
 ---
-Parsers for UD-NewsCrawl
 | Feature | Description |
 | --- | --- |

       type: f_score
       value: 0.9977571291
 ---
+<img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
+# UD Parser (Multilingual context-sensitive vectors + transition-based parser)
+This is the spaCy pipeline trained on [UD-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl).
+It uses [multi hash embeddings](https://arxiv.org/abs/2212.09255) using [floret](https://github.com/explosion/floret).
+It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
+The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
+More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
 | Feature | Description |
 | --- | --- |