FpOliveira
/

tupi-bert-base-portuguese-cased

Text Classification

Inference Endpoints

Model card Files Files and versions Community

tupi-bert-base-portuguese-cased / README.md

FpOliveira's picture

Update README.md

59baefe about 1 year ago

|

1.15 kB

	---
	license: mit
	datasets:
	- FpOliveira/TuPi-Portuguese-Hate-Speech-Dataset-Binary
	language:
	- pt
	metrics:
	- accuracy
	- precision
	- recall
	- f1
	pipeline_tag: text-classification
	---

	## Introduction

	Tupi-Bert-Base is a fine-tuned BERT model based on [BERTimbau](https://huggingface.co/neuralmind/bert-base-portuguese-cased) base.For further information or requests, please go to [BERTimbau repository](https://github.com/neuralmind-ai/portuguese-bert/)."

	The performance of Language Models can change drastically when there is a domain shift between training and test data. In order create a Portuguese Language Model adapted to a Legal domain, the original BERTimbau model was submitted to a fine-tuning stage where it was performed 1 "PreTraining" epoch over 30 000 legal Portuguese Legal documents available online.


	## Available models

	\| Model \| Arch. \| #Layers \| #Params \|
	\| ---------------------------------------- \| ---------- \| ------- \| ------- \|
	\| `FpOliveira/tupi-bert-base-portuguese-cased` \| BERT-Base \|12 \|110M\|
	\| `FpOliveira/tupi-bert-large-portuguese-cased` \| BERT-Large \| 24 \| 335M \|