knowhate
/

HateBERTimbau

Not-For-All-Audiences

Model card Files Files and versions Community

gilramos commited on May 11, 2024

Commit

647cdcf

·

verified ·

1 Parent(s): a31e1fb

Update README.md

Files changed (1) hide show

README.md +7 -37

README.md CHANGED Viewed

@@ -21,9 +21,7 @@ widget:
 # HateBERTimbau
-<!-- Provide a quick summary of what the model is/does. -->
-This modelcard aims to be a base template for new models. It has been generated using [this raw template](https://github.com/huggingface/huggingface_hub/blob/main/src/huggingface_hub/templates/modelcard_template.md?plain=1).
 ## Model Details
@@ -54,53 +52,25 @@ This modelcard aims to be a base template for new models. It has been generated
 ### Training Data
-<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
-[More Information Needed]
 ### Training Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
-#### Preprocessing [optional]
-[More Information Needed]
 #### Training Hyperparameters
-- **Training regime:** [More Information Needed] <!--fp32, fp16 mixed precision, bf16 mixed precision, bf16 non-mixed precision, fp16 non-mixed precision, fp8 mixed precision -->
 ## Evaluation
-<!-- This section describes the evaluation protocols and provides the results. -->
-### Testing Data, Factors & Metrics
-#### Testing Data
-<!-- This should link to a Dataset Card if possible. -->
-[More Information Needed]
-#### Factors
-<!-- These are the things the evaluation is disaggregating by, e.g., subpopulations or domains. -->
-[More Information Needed]
-#### Metrics
-<!-- These are the evaluation metrics being used, ideally with a description of why. -->
-[More Information Needed]
 ### Results
-[More Information Needed]
-#### Summary
 ## BibTeX Citation

 # HateBERTimbau
+HateBERTimbau is a transformer-based encoder model for identifying hate speech in Portuguese social media text. It is a fine-tuned version of the [BERTimbau](https://huggingface.co/neuralmind/bert-large-portuguese-cased) model, retrained on a dataset of 229,103 tweets specifically focused on potential hate speech.
 ## Model Details
 ### Training Data
+229,103 tweets associated with offensive content were used
 ### Training Procedure
 <!-- This relates heavily to the Technical Specifications. Content here should link to that section when it is relevant to the training procedure. -->
 #### Training Hyperparameters
+- Batch Size: 4 samples
+- Epochs: 100
+- Learning Rate: 5e-5 with Adam optimizer
+- Maximum Sequence Length: 512 sentence pieces
 ## Evaluation
+### Testing Data
 ### Results
 ## BibTeX Citation