|
--- |
|
license: mit |
|
datasets: |
|
- FpOliveira/TuPi-Portuguese-Hate-Speech-Dataset-Binary |
|
language: |
|
- pt |
|
metrics: |
|
- accuracy |
|
- precision |
|
- recall |
|
- f1 |
|
pipeline_tag: text-classification |
|
--- |
|
|
|
## Introduction |
|
|
|
Tupi-Bert-Base is a fine-tuned BERT model based on [BERTimbau](https://huggingface.co/neuralmind/bert-base-portuguese-cased) base.For further information or requests, please go to [BERTimbau repository](https://github.com/neuralmind-ai/portuguese-bert/)." |
|
|
|
The performance of Language Models can change drastically when there is a domain shift between training and test data. In order create a Portuguese Language Model adapted to a Legal domain, the original BERTimbau model was submitted to a fine-tuning stage where it was performed 1 "PreTraining" epoch over 30 000 legal Portuguese Legal documents available online. |
|
|
|
|
|
## Available models |
|
|
|
| Model | Arch. | #Layers | #Params | |
|
| ---------------------------------------- | ---------- | ------- | ------- | |
|
| `FpOliveira/tupi-bert-base-portuguese-cased` | BERT-Base |12 |110M| |
|
| `FpOliveira/tupi-bert-large-portuguese-cased` | BERT-Large | 24 | 335M | |
|
|