antypasd commited on
Commit
f6a1aff
·
1 Parent(s): 45c42b4

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +50 -0
README.md ADDED
@@ -0,0 +1,50 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - cardiffnlp/super_tweeteval
5
+ language:
6
+ - en
7
+ pipeline_tag: text-classification
8
+ ---
9
+ # cardiffnlp/twitter-roberta-large-latest-tweet-hate
10
+
11
+
12
+ This is a RoBERTa-large model trained on 154M tweets until the end of December 2022 and finetuned for hate speech detection (multiclass classification) on the _TweetHate_ dataset of [SuperTweetEval](https://huggingface.co/datasets/cardiffnlp/super_tweeteval).
13
+ The original Twitter-based RoBERTa model can be found [here](https://huggingface.co/cardiffnlp/twitter-roberta-large-2022-154m).
14
+
15
+ # Labels
16
+ <code>
17
+ "id2label": {
18
+ "0": "hate_gender",
19
+ "1": "hate_race",
20
+ "2": "hate_sexuality",
21
+ "3": "hate_religion",
22
+ "4": "hate_origin",
23
+ "5": "hate_disability",
24
+ "6": "hate_age",
25
+ "7": "not_hate"
26
+ }
27
+ </code>
28
+
29
+ ## Example
30
+ ```python
31
+ from transformers import pipeline
32
+ text = 'Eid Mubarak Everyone!!! ❤ May Allah unite all Muslims, show us the right path, and bless us with good health.❣'
33
+
34
+ pipe = pipeline('text-classification', model="cardiffnlp/twitter-roberta-large-latest-tweet-hate")
35
+ pipe(text)
36
+ >> [{'label': 'not_hate', 'score': 0.9997966885566711}]
37
+ ```
38
+
39
+ ## Citation Information
40
+
41
+ Please cite the [reference paper](https://arxiv.org/abs/2310.14757) if you use this model.
42
+
43
+ ```bibtex
44
+ @inproceedings{antypas2023supertweeteval,
45
+ title={SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research},
46
+ author={Dimosthenis Antypas and Asahi Ushio and Francesco Barbieri and Leonardo Neves and Kiamehr Rezaee and Luis Espinosa-Anke and Jiaxin Pei and Jose Camacho-Collados},
47
+ booktitle={Findings of the Association for Computational Linguistics: EMNLP 2023},
48
+ year={2023}
49
+ }
50
+ ```