README.md · MilaNLProc/njh-classifier at main

metadata

language:
  - en
base_model:
  - vinai/bertweet-large

Not Just Hate (NJH) - Uploaded Version The uploaded balanced model for multi-label harmful speech classification. Labels: Profanity, Insults, Outrage, Character Assassination, Discrimination, Hostility, Incivility, and Intolerance.

Bianchi, F., Hills, S., Rossini, P., Hovy, D., Tromble, R., & Tintarev, N. (2022). "It's not just hate": a multi-dimensional perspective on detecting harmful speech online. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.