njh-classifier / README.md
vinid's picture
Update README.md
a87d756 verified
metadata
language:
  - en
base_model:
  - vinai/bertweet-large

Not Just Hate (NJH) - Uploaded Version The uploaded balanced model for multi-label harmful speech classification. Labels: Profanity, Insults, Outrage, Character Assassination, Discrimination, Hostility, Incivility, and Intolerance.

Bianchi, F., Hills, S., Rossini, P., Hovy, D., Tromble, R., & Tintarev, N. (2022). "It's not just hate": a multi-dimensional perspective on detecting harmful speech online. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.