paulagarciaserrano's picture
Update README.md
4922306
|
raw
history blame
2.16 kB

This model is a fine-tuned version the cardiffnlp/twitter-roberta-base model. It has been trained using a recently published corpus: Shared task on Detecting Signs of Depression from Social Media Text at LT-EDI 2022-ACL 2022.

The obtained macro f1-score is 0.54, on the development set of the competition.

Intended uses

This model is trained to classify the given text into one of the following classes: moderate, severe, or not depressed. It corresponds to a multiclass classification task.

Training and evaluation data

The train dataset characteristics are:

Class Nº sentences Avg. document length (in sentences) Nº words Avg. sentence length (in words)
not depression 7,884 4 153,738 78
moderate 36,114 6 601,900 100
severe 9,911 11 126,140 140

Similarly, the evaluation dataset characteristics are:

Class Nº sentences Avg. document length (in sentences) Nº words Avg. sentence length (in words)
not depression 3,660 2 10,980 6
moderate 66,874 29 804,794 349
severe 2,880 8 75,240 209

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-05
  • evaluation_strategy: epoch
  • save_strategy: epoch
  • per_device_train_batch_size: 8
  • per_device_eval_batch_size: 8
  • num_train_epochs: 5
  • seed: 10
  • weight_decay: 0.01
  • metric_for_best_model: macro-f1