IsmaelMousa's picture
update model card-6
c1a5f63 verified
metadata
language:
  - en
license: apache-2.0
tags:
  - pytorch
  - emotion-classification
base_model: bert-base-cased
datasets:
  - dair-ai/emotion
pipeline_tag: text-classification
model-index:
  - name: bert-finetuned-emotion
    results: []
library_name: transformers

bert-finetuned-emotion

This model is a fine-tuned version of bert-base-cased on the emotion dataset. It achieves the following results on the evaluation set:

  • Loss: 0.1656

Model description

The bert-finetuned-emotion model is a fine-tuned version of the BERT model for text classification, specifically trained for emotion classification tasks. It utilizes the BERT architecture, a powerful pre-trained language representation model developed by Google, and fine-tunes it on the dair-ai/emotion dataset. The model aims to predict the emotion associated with a given text input.

Intended uses & limitations

Intended Uses

  • Emotion classification in text: The model can be used to classify the emotions conveyed in textual data, aiding applications such as sentiment analysis, customer feedback analysis, and social media monitoring.
  • Integration into applications: This model can be integrated into various applications and platforms to provide emotion analysis functionalities.

Limitations

  • Domain-specific limitations: The model's performance may vary depending on the domain of the text data. It is primarily trained on general textual data and may not perform optimally on specialized domains.
  • Language limitations: The model is trained primarily on English text and may not generalize well to other languages without further adaptation.
  • Bias and fairness: As with any machine learning model, biases present in the training data may be reflected in the model's predictions. Care should be taken to mitigate biases, especially when deploying the model in sensitive applications.

Training and evaluation data

Dataset

The model is trained on the dair-ai/emotion dataset, which contains text samples labeled with emotions such as love, surprise, joy, sadness, anger and fear. The dataset provides a diverse range of textual expressions of emotions, enabling the model to learn patterns associated with different emotional states.

Data Preprocessing

Before training, the text data undergoes preprocessing steps such as tokenization, lowercasing, and truncation to prepare it for input into the BERT model.

Training procedure

The model is fine-tuned using transfer learning on top of the pre-trained BERT model. During training, the parameters of the BERT model are fine-tuned using backpropagation and gradient descent optimization to minimize a loss function, typically categorical cross-entropy, on the emotion classification task. The fine-tuning process involves adjusting the model's weights based on the labeled examples in the dair-ai/emotion dataset.

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 3.0

Training results

Training Loss Epoch Step Validation Loss
0.2653 1.0 2000 0.2193
0.1552 2.0 4000 0.1690
0.1028 3.0 6000 0.1656

Framework versions

  • Transformers 4.40.1
  • Pytorch 2.2.1+cu121
  • Datasets 2.19.1
  • Tokenizers 0.19.1