g8a9 commited on
Commit
09c4af3
·
1 Parent(s): ced696a

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +65 -0
README.md ADDED
@@ -0,0 +1,65 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language: it
3
+ license: gpl-3.0
4
+ tags:
5
+ - text classification
6
+ - misogyny detection
7
+ - abusive language
8
+ - hate speech
9
+ - offensive language
10
+
11
+ widget:
12
+ - text: "Credo che le donne debbano essere tutelate maggiormente."
13
+ example_title: "Misogyny Detection Example 1"
14
+
15
+ ---
16
+
17
+ # Entropy-based Attention Regularization 👂
18
+
19
+ This is an Italian BERT fine-tuned with [Entropy-based Attention Regularization](https://aclanthology.org/2022.findings-acl.88/) to reduce lexical overfitting to specific words on the task of Misogyny Identification.
20
+ Use this model if you want a debiased alternative to a BERT classifier.
21
+
22
+ Please refer to the paper to know all the training details.
23
+
24
+ ## Model
25
+
26
+ This model is the fine-tuned version of the Italian [dbmdz/bert-base-italian-uncased](https://huggingface.co/dbmdz/bert-base-italian-uncased) model.
27
+ We trained a total of three versions for Italian and English.
28
+
29
+ | Model | Download |
30
+ | ------ | -------------------------|
31
+ | `bert-base-uncased-ear-misogyny` | [Link](https://huggingface.co/MilaNLProc/bert-base-uncased-ear-misogyny) |
32
+ | `bert-base-uncased-ear-mlma` | [Link]() |
33
+ | `bert-base-uncased-ear-misogyny-italian` | [Link](https://huggingface.co/MilaNLProc/bert-base-uncased-ear-misogyny-italian) |
34
+
35
+ # Authors
36
+ - [Giuseppe Attanasio](https://gattanasio.cc/)
37
+ - [Debora Nozza](http://dnozza.github.io/)
38
+ - [Dirk Hovy](https://federicobianchi.io/)
39
+ - Elena
40
+
41
+ # Citation
42
+
43
+ Please use the following BibTeX entry if you use this model in your project:
44
+
45
+ ```
46
+ @inproceedings{attanasio-etal-2022-entropy,
47
+ title = "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists",
48
+ author = "Attanasio, Giuseppe and
49
+ Nozza, Debora and
50
+ Hovy, Dirk and
51
+ Baralis, Elena",
52
+ booktitle = "Findings of the Association for Computational Linguistics: ACL 2022",
53
+ month = may,
54
+ year = "2022",
55
+ address = "Dublin, Ireland",
56
+ publisher = "Association for Computational Linguistics",
57
+ url = "https://aclanthology.org/2022.findings-acl.88",
58
+ doi = "10.18653/v1/2022.findings-acl.88",
59
+ pages = "1105--1119",
60
+ abstract = "Natural Language Processing (NLP) models risk overfitting to specific terms in the training data, thereby reducing their performance, fairness, and generalizability. E.g., neural hate speech detection models are strongly influenced by identity terms like gay, or women, resulting in false positives, severe unintended bias, and lower performance.Most mitigation techniques use lists of identity terms or samples from the target domain during training. However, this approach requires a-priori knowledge and introduces further bias if important terms are neglected.Instead, we propose a knowledge-free Entropy-based Attention Regularization (EAR) to discourage overfitting to training-specific terms. An additional objective function penalizes tokens with low self-attention entropy.We fine-tune BERT via EAR: the resulting model matches or exceeds state-of-the-art performance for hate speech classification and bias metrics on three benchmark corpora in English and Italian.EAR also reveals overfitting terms, i.e., terms most likely to induce bias, to help identify their effect on the model, task, and predictions.",
61
+ }
62
+ ```
63
+
64
+ ## License
65
+ [GNU GPLv3](https://choosealicense.com/licenses/gpl-3.0/)