paulagarciaserrano
/

roberta-depression-detection

Text Classification

Inference Endpoints

Model card Files Files and versions Community

roberta-depression-detection / README.md

paulagarciaserrano's picture

paulagarciaserrano

Update README.md

4922306 over 2 years ago

|

2.16 kB

	This model is a fine-tuned version the <a href="https://huggingface.co/cardiffnlp/twitter-roberta-base">cardiffnlp/twitter-roberta-base</a> model. It has been trained using a recently published corpus: <a href="https://competitions.codalab.org/competitions/36410#learn_the_details">Shared task on Detecting Signs of Depression from Social Media Text at LT-EDI 2022-ACL 2022</a>.

	The obtained macro f1-score is 0.54, on the development set of the competition.

	# Intended uses
	This model is trained to classify the given text into one of the following classes: moderate, severe, or not depressed.
	It corresponds to a multiclass classification task.

	# Training and evaluation data
	The train dataset characteristics are:

	<table>
	<tr>
	<th>Class</th>
	<th>Nº sentences</th>
	<th>Avg. document length (in sentences)</th>
	<th>Nº words</th>
	<th>Avg. sentence length (in words)</th>
	</tr>
	<tr>
	<th>not depression</th>
	<td>7,884</td>
	<td>4</td>
	<td>153,738</td>
	<td>78</td>
	</tr>
	<tr>
	<th>moderate</th>
	<td>36,114</td>
	<td>6</td>
	<td>601,900</td>
	<td>100</td>
	</tr>
	<tr>
	<th>severe</th>
	<td>9,911</td>
	<td>11</td>
	<td>126,140</td>
	<td>140</td>
	</tr>
	</table>

	Similarly, the evaluation dataset characteristics are:

	<table>
	<tr>
	<th>Class</th>
	<th>Nº sentences</th>
	<th>Avg. document length (in sentences)</th>
	<th>Nº words</th>
	<th>Avg. sentence length (in words)</th>
	</tr>
	<tr>
	<th>not depression</th>
	<td>3,660</td>
	<td>2</td>
	<td>10,980</td>
	<td>6</td>
	</tr>
	<tr>
	<th>moderate</th>
	<td>66,874</td>
	<td>29</td>
	<td>804,794</td>
	<td>349</td>
	</tr>
	<tr>
	<th>severe</th>
	<td>2,880</td>
	<td>8</td>
	<td>75,240</td>
	<td>209</td>
	</tr>
	</table>

	# Training hyperparameters
	The following hyperparameters were used during training:

	* learning_rate: 2e-05
	* evaluation_strategy: epoch
	* save_strategy: epoch
	* per_device_train_batch_size: 8
	* per_device_eval_batch_size: 8
	* num_train_epochs: 5
	* seed: 10
	* weight_decay: 0.01
	* metric_for_best_model: macro-f1