File size: 1,292 Bytes
c6aa3f3
 
 
 
 
 
 
 
 
 
 
 
fd5fcb6
c6aa3f3
 
91c05c9
 
 
 
 
 
 
 
 
c6aa3f3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
---
pipeline_tag: token-classification
tags:
- named-entity-recognition
- sequence-tagger-model
widget:
- text: Мене звуть Амадей Вольфганг, я живу в Берліні
inference:
  parameters:
    aggregation_strategy: simple
    grouped_entities: true
language:
- uk
---

xlm-roberta model trained on [ukrainian ner](https://github.com/lang-uk/flair-ner) dataset from flair

| Test metric             | Results                   |
|-------------------------|---------------------------|
| test_f1_mac_ukr_ner     | 0.9900672435760498        |
| test_loss_ukr_ner       | 0.054602641612291336      |
| test_prec_mac_ukr_ner   | 0.9386032819747925        |
| test_rec_mac_ukr_ner    | 0.9383019208908081        |



```python
from transformers import AutoTokenizer, AutoModelForTokenClassification
from transformers import pipeline

tokenizer = AutoTokenizer.from_pretrained("EvanD/xlm-roberta-base-ukrainian-ner-ukrner")
ner_model = AutoModelForTokenClassification.from_pretrained("EvanD/xlm-roberta-base-ukrainian-ner-ukrner")

nlp = pipeline("ner", model=ner_model, tokenizer=tokenizer, aggregation_strategy="simple")
example = "Мене звуть Амадей Вольфганг, я живу в Берліні"

ner_results = nlp(example)
print(ner_results)
```