FpOliveira's picture
Update README.md
59baefe
|
raw
history blame
1.15 kB
---
license: mit
datasets:
- FpOliveira/TuPi-Portuguese-Hate-Speech-Dataset-Binary
language:
- pt
metrics:
- accuracy
- precision
- recall
- f1
pipeline_tag: text-classification
---
## Introduction
Tupi-Bert-Base is a fine-tuned BERT model based on [BERTimbau](https://huggingface.co/neuralmind/bert-base-portuguese-cased) base.For further information or requests, please go to [BERTimbau repository](https://github.com/neuralmind-ai/portuguese-bert/)."
The performance of Language Models can change drastically when there is a domain shift between training and test data. In order create a Portuguese Language Model adapted to a Legal domain, the original BERTimbau model was submitted to a fine-tuning stage where it was performed 1 "PreTraining" epoch over 30 000 legal Portuguese Legal documents available online.
## Available models
| Model | Arch. | #Layers | #Params |
| ---------------------------------------- | ---------- | ------- | ------- |
| `FpOliveira/tupi-bert-base-portuguese-cased` | BERT-Base |12 |110M|
| `FpOliveira/tupi-bert-large-portuguese-cased` | BERT-Large | 24 | 335M |