Fine-grained POS Tagging of German Tweets
This Flair model was trained on the German Tweets dataset that is presented in the Fine-grained POS Tagging of German Tweets paper from Ines Rehbein.
It achieves an accuracy of 92.88% on the development set and an accuracy of 93.16% on the final test dataset.
Training
All training code is released in this repository.
The model architecture uses the training strategy as proposed in the original Flair paper: German FastText embeddings and German Flair Embeddings are stacked and passed into a BiLSTM-CRF sequence labeler, achieving robost SOTA results on PoS Tagging of German Tweets.
The full training log can be found here.
Demo: How to use in Flair
from flair.data import Sentence
from flair.models import SequenceTagger
model = SequenceTagger.load('flair/de-pos-fine-grained')
sent = Sentence("@Sneeekas Ich nicht \o/", use_tokenizer=False)
model.predict(sent)
print(sent)
This yields the following output:
Sentence[4]: "@Sneeekas Ich nicht \o/" → ["@Sneeekas"/ADDRESS, "Ich"/PPER, "nicht"/PTKNEG, "\o/"/EMO]
- Downloads last month
- 6
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API:
The model has no pipeline_tag.