---
license: mit
datasets:
- SkyWater21/lv_emotions
language:
- lv
base_model:
- google-bert/bert-base-multilingual-cased
---
Fine-tuned [Multilingual BERT](https://huggingface.co/google-bert/bert-base-multilingual-cased) for multi-label emotion classification task.

Model was trained on [lv_emotions](https://huggingface.co/datasets/SkyWater21/lv_emotions) dataset. This dataset is Latvian translation of [GoEmotions](https://huggingface.co/datasets/go_emotions) and [Twitter Emotions](https://huggingface.co/datasets/SkyWater21/lv_twitter_emotions) dataset. Google Translate was used to generate the machine translation.

Original 26 emotions were mapped to 6 base emotions as per Dr. Ekman theory.

Labels predicted by classifier:
```yaml
0: anger
1: disgust
2: fear
3: joy
4: sadness
5: surprise
6: neutral
```

Seed used for random number generator is 42:
```python
def set_seed(seed=42):
    random.seed(seed)
    np.random.seed(seed)
    torch.manual_seed(seed)
    if torch.cuda.is_available():
        torch.cuda.manual_seed_all(seed)
```

Training parameters:
```yaml
max_length: null
batch_size: 32
shuffle: True
num_workers: 4
pin_memory: False
drop_last: False
optimizer: adam
lr: 0.00001
weight_decay: 0
problem_type: multi_label_classification
num_epochs: 4
```


Evaluation results on test split of [lv_go_emotions](https://huggingface.co/datasets/SkyWater21/lv_emotions/viewer/combined/lv_go_emotions_test)
|              |Precision|Recall|F1-Score|Support|
|--------------|---------|------|--------|-------|
|anger         |     0.50|  0.35|    0.41|    726|
|disgust       |     0.44|  0.28|    0.35|    123|
|fear          |     0.58|  0.47|    0.52|     98|
|joy           |     0.80|  0.76|    0.78|   2104|
|sadness       |     0.66|  0.41|    0.51|    379|
|surprise      |     0.59|  0.55|    0.57|    677|
|neutral       |     0.71|  0.43|    0.54|   1787|
|micro avg     |     0.70|  0.55|    0.62|   5894|
|macro avg     |     0.61|  0.46|    0.52|   5894|
|weighted avg  |     0.69|  0.55|    0.61|   5894|
|samples avg   |     0.58|  0.56|    0.57|   5894|

Evaluation results on test split of [lv_twitter_emotions](https://huggingface.co/datasets/SkyWater21/lv_emotions/viewer/combined/lv_twitter_emotions_test)
|              |Precision|Recall|F1-Score|Support|
|--------------|---------|------|--------|-------|
|anger         |     0.92|  0.88|    0.90|  12013|
|disgust       |     0.90|  0.94|    0.92|  14117|
|fear          |     0.82|  0.67|    0.74|   3342|
|joy           |     0.88|  0.84|    0.86|   5913|
|sadness       |     0.86|  0.75|    0.80|   4786|
|surprise      |     0.94|  0.56|    0.70|   1510|
|neutral       |     0.00|  0.00|    0.00|      0|
|micro avg     |     0.90|  0.85|    0.87|  41681|
|macro avg     |     0.76|  0.66|    0.70|  41681|
|weighted avg  |     0.90|  0.85|    0.87|  41681|
|samples avg   |     0.85|  0.85|    0.85|  41681|