LEIA
/

This is an emotion classification model based on fine-tuning of a Bernice model, which is a pre-trained model trained on multilingual Twitter data. The fine-tuning dataset is a subset of the self-labeled emotion dataset (Lykousas et al., 2019) in English that corresponds to Anger, Fear, Sadness, Joy, and Affection. See the paper, LEIA: Linguistic Embeddings for the Identification of Affect for further details.

Evaluation

We evaluated LEIA-multilingual on posts with self-annotated emotion labels identified as non-English using an ensemble of language identification tools. The table below shows the macro-F1 scores aggregated across emotion categories for each language:

Language Macro-F1
ar 44.18[43.07,45.29]
da 65.44[60.96,69.83]
de 60.47[57.58,63.38]
es 61.67[60.79,62.55]
fi 45.1[40.96,49.14]
fr 65.78[63.19,68.36]
it 63.37[59.67,67.1]
pt 57.27[55.15,59.4]
tl 58.37[55.51,61.23]
tr 45.42[41.17,49.79]

Citation

Please cite the following paper if you find the model useful for your work:

@article{aroyehun2023leia,
  title={LEIA: Linguistic Embeddings for the Identification of Affect},
  author={Aroyehun, Segun Taofeek and Malik, Lukas and Metzler, Hannah and Haimerl, Nikolas and Di Natale, Anna and Garcia, David},
  journal={EPJ Data Science},
  volume={12},
  year={2023},
  publisher={Springer}
}
Downloads last month
31
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.