ljvmiranda921 commited on
Commit
132e489
1 Parent(s): 71b12a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -1
README.md CHANGED
@@ -58,7 +58,16 @@ model-index:
58
  type: f_score
59
  value: 0.9977571291
60
  ---
61
- Parsers for UD-NewsCrawl
 
 
 
 
 
 
 
 
 
62
 
63
  | Feature | Description |
64
  | --- | --- |
 
58
  type: f_score
59
  value: 0.9977571291
60
  ---
61
+
62
+ <img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
63
+
64
+ # UD Parser (Multilingual context-sensitive vectors + transition-based parser)
65
+
66
+ This is the spaCy pipeline trained on [UD-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl).
67
+ It uses [multi hash embeddings](https://arxiv.org/abs/2212.09255) using [floret](https://github.com/explosion/floret).
68
+ It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
69
+ The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
70
+ More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
71
 
72
  | Feature | Description |
73
  | --- | --- |