This model is a fine-tuned version the cardiffnlp/twitter-roberta-base model. It has been trained using a recently published corpus: Shared task on Detecting Signs of Depression from Social Media Text at LT-EDI 2022-ACL 2022.
The obtained macro f1-score is 0.54, on the development set of the competition.
Intended uses
This model is trained to classify the given text into one of the following classes: moderate, severe, or not depressed. It corresponds to a multiclass classification task.
Training and evaluation data
The train dataset characteristics are:
Class | Nº sentences | Avg. document length (in sentences) | Nº words | Avg. sentence length (in words) |
---|---|---|---|---|
not depression | 7,884 | 4 | 153,738 | 78 |
moderate | 36,114 | 6 | 601,900 | 100 |
severe | 9,911 | 11 | 126,140 | 140 |
Similarly, the evaluation dataset characteristics are:
Class | Nº sentences | Avg. document length (in sentences) | Nº words | Avg. sentence length (in words) |
---|---|---|---|---|
not depression | 3,660 | 2 | 10,980 | 6 |
moderate | 66,874 | 29 | 804,794 | 349 |
severe | 2,880 | 8 | 75,240 | 209 |
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 2e-05
- evaluation_strategy: epoch
- save_strategy: epoch
- per_device_train_batch_size: 8
- per_device_eval_batch_size: 8
- num_train_epochs: 5
- seed: 10
- weight_decay: 0.01
- metric_for_best_model: macro-f1