license: apache-2.0
tags:
- generated_from_trainer
datasets:
- AdamCodd/emotion-balanced
metrics:
- accuracy
- f1
- recall
- precision
base_model: bert-tiny
model-index:
- name: tinybert-emotion-balanced
results:
- task:
type: text-classification
name: Text Classification
dataset:
name: emotion
type: emotion
args: default
metrics:
- type: accuracy
value: 0.9354
name: Accuracy
- type: loss
value: 0.1809
name: Loss
- type: f1
value: 0.9354946613311768
name: F1
tinybert-emotion
This model is a fine-tuned version of bert-tiny on the emotion balanced dataset. It achieves the following results on the evaluation set:
- Loss: 0.1809
- Accuracy: 0.9354
Model description
TinyBERT is 7.5 times smaller and 9.4 times faster on inference compared to its teacher BERT model (while DistilBERT is 40% smaller and 1.6 times faster than BERT). The model has been trained on 89_754 examples split into train, validation and test. Each label was perfectly balanced in each split.
Intended uses & limitations
This model is not as accurate as the distilbert-emotion-balanced one because the focus was on speed, which can lead to misinterpretation of complex sentences. Despite this, its performance is quite good and should be more than sufficient for most use cases.
Usage:
from transformers import pipeline
# Create the pipeline
emotion_classifier = pipeline('text-classification', model='AdamCodd/tinybert-emotion-balanced')
# Now you can use the pipeline to classify emotions
result = emotion_classifier("We are delighted that you will be coming to visit us. It will be so nice to have you here.")
print(result)
#[{'label': 'joy', 'score': 0.9895486831665039}]
This model faces challenges in accurately categorizing negative sentences, as well as those containing elements of sarcasm or irony. These limitations are largely attributable to TinyBERT's constrained capabilities in semantic understanding. Although the model is generally proficient in emotion detection tasks, it may lack the nuance necessary for interpreting complex emotional nuances.
Training and evaluation data
More information needed
Training procedure
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 3e-05
- train_batch_size: 32
- eval_batch_size: 64
- seed: 1270
- optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- lr_scheduler_warmup_steps: 150
- num_epochs: 10
- weight_decay: 0.01
Training results
precision recall f1-score support
sadness 0.9733 0.9245 0.9482 1496
joy 0.9651 0.8864 0.9240 1496
love 0.9127 0.9786 0.9445 1496
anger 0.9479 0.9365 0.9422 1496
fear 0.9213 0.9004 0.9108 1496
surprise 0.9016 0.9866 0.9422 1496
accuracy 0.9355 8976
macro avg 0.9370 0.9355 0.9353 8976
weighted avg 0.9370 0.9355 0.9353 8976
test_acc: 0.9354946613311768
test_loss: 0.1809326708316803
Framework versions
- Transformers 4.33.0
- Pytorch lightning 2.0.8
- Tokenizers 0.13.3
If you want to support me, you can here.