File size: 1,570 Bytes
38cb5e6
 
 
 
 
 
 
 
 
 
 
 
59baefe
 
 
 
 
c80eb66
 
 
59baefe
 
 
 
 
 
 
c80eb66
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
---
license: mit
datasets:
- FpOliveira/TuPi-Portuguese-Hate-Speech-Dataset-Binary
language:
- pt
metrics:
- accuracy
- precision
- recall
- f1
pipeline_tag: text-classification
---

## Introduction


Tupi-BERT-Base represents a fine-tuned BERT model designed specifically for binary classification of hate speech in Portuguese. Derived from the [BERTimbau base](https://huggingface.co/neuralmind/bert-base-portuguese-cased), TuPi are model family dedicated solution for addressing hate speech concerns.
For more details or specific inquiries, please refer to the [BERTimbau repository](https://github.com/neuralmind-ai/portuguese-bert/).
The efficacy of Language Models can exhibit notable variations when confronted with a shift in domain between training and test data. In the creation of a specialized Portuguese Language Model tailored for hate speech classification, the original BERTimbau model underwent meticulous fine-tuning. This process entailed a singular "PreTraining" epoch carried out on the TuPi Hate Speech DataSet, sourced from diverse social networks.

## Available models

| Model                                    | Arch.      | #Layers | #Params |
| ---------------------------------------- | ---------- | ------- | ------- |
| `FpOliveira/tupi-bert-base-portuguese-cased`  | BERT-Base	|12	|110M|
| `FpOliveira/tupi-bert-large-portuguese-cased` | BERT-Large | 24      | 335M    |
| `FpOliveira/tupi-bert-large-portuguese-cased` | BERT-Large | 24      | 335M    |
| `FpOliveira/tupi-bert-large-portuguese-cased` | BERT-Large | 24      | 335M    |