|
--- |
|
license: mit |
|
language: |
|
- multilingual |
|
library_name: transformers |
|
pipeline_tag: text-classification |
|
widget: |
|
- text: "You wont believe what happened to me today" |
|
- text: "You wont believe what happened to me today!" |
|
- text: "You wont believe what happened to me today..." |
|
- text: "You wont believe what happened to me today <3" |
|
- text: "You wont believe what happened to me today :)" |
|
- text: "You wont believe what happened to me today :(" |
|
--- |
|
This is an emotion classification model based on fine-tuning of a Bernice model (which is a multilingual pre-trained model trained on multilingual Twitter data) on self-labeled emotion dataset (Lykousas et al., 2019) in English that corresponds to Anger, Fear, Sadness, Joy, and Affection. |
|
See the paper, [LEIA: Linguistic Embeddings for the Identification of Affect](https://arxiv.org/abs/2304.10973) for further details. |
|
|
|
## Evaluation |
|
We evaluated LEIA-multilingual on Vent posts with self-annotated emotion labels identified as non-English using an ensemble of language identefication tools. |
|
The table below shows the macro-F1 scores across emotion categories and languages: |
|
|
|
|
|
|
|
|Language|Macro-F1| |
|
|:---:|:---:| |
|
|ar |44.18[43.07,45.29]| |
|
|da |65.44[60.96,69.83] | |
|
|de |60.47[57.58,63.38] | |
|
|es |61.67[60.79,62.55] | |
|
|fi |45.1[40.96,49.14] | |
|
|fr |65.78[63.19,68.36] | |
|
|it |63.37[59.67,67.1] | |
|
|pt |57.27[55.15,59.4] | |
|
|tl |58.37[55.51,61.23] | |
|
|tr |45.42[41.17,49.79]| |
|
|
|
|
|
|
|
## Citation |
|
Please cite the following paper if you find the model useful for your work: |
|
```bibtex |
|
@article{aroyehun2023leia, |
|
title={LEIA: Linguistic Embeddings for the Identification of Affect}, |
|
author={Aroyehun, Segun Taofeek and Malik, Lukas and Metzler, Hannah and Haimerl, Nikolas and Di Natale, Anna and Garcia, David}, |
|
journal={EPJ Data Science}, |
|
volume={12}, |
|
year={2023}, |
|
publisher={Springer} |
|
} |
|
``` |