license: apache-2.0
tags:
- text-classification
- depression
- reddit
- generated_from_trainer
datasets:
- mrjunos/depression-reddit-cleaned
metrics:
- accuracy
widget:
- text:
- >-
i just found out my boyfriend is depressed i really want to be there for
him but i feel like i ve only been saying the wrong thing how can i be
there for him help him and see him get better i m worried it will
continue to the point it will consume him i can already see his
personality changing and i m scared for the future what thing can i say
or do to comfort or help
example_title: depression
- text:
- >-
i m getting more and more people asking where they can buy the ambients
album simple answer is quot not yet quot it ll be on itunes eventually
example_title: not_depression
model-index:
- name: depression-reddit-distilroberta-base
results:
- task:
name: Text Classification
type: text-classification
dataset:
name: mrjunos/depression-reddit-cleaned
type: depression-reddit-cleaned
config: default
split: train
args: default
metrics:
- name: Accuracy
type: accuracy
value: 0.9715578539107951
language:
- en
pipeline_tag: text-classification
Example Pipeline
from transformers import pipeline
predict_task = pipeline(model="mrjunos/depression-reddit-distilroberta-base", task="text-classification")
predict_task("Stop listing your issues here, use forum instead or open ticket.")
[{'label': 'not_depression', 'score': 0.9813856482505798}]
Disclaimer: This machine learning model classifies texts related to depression, but I am not an expert or a mental health professional. I do not intend to diagnose or offer medical advice. The information provided should not replace consultation with a qualified professional. The results may not be accurate. Use this model at your own risk and seek professional advice if needed.
This model is a fine-tuned version of distilroberta-base on the mrjunos/depression-reddit-cleaned dataset. It achieves the following results on the evaluation set:
- Loss: 0.0821
- Accuracy: 0.9716
Model description
This model is a transformer-based model that has been fine-tuned on a dataset of Reddit posts related to depression. The model can be used to classify posts as either depression or not depression.
Intended uses & limitations
This model is intended to be used for research purposes. It is not yet ready for production use. The model has been trained on a dataset of English-language posts, so it may not be accurate for other languages.
Training and evaluation data
The model was trained on the mrjunos/depression-reddit-cleaned dataset, which contains approximately 7,000 labeled instances. The data was split into Train and Test using:
ds = ds['train'].train_test_split(test_size=0.2, seed=42)
The dataset consists of two main features: 'text' and 'label'. The 'text' feature contains the text data from Reddit posts related to depression, while the 'label' feature indicates whether a post is classified as depression or not.
Training procedure
You can find here the steps I followed to train this model: https://github.com/mrjunos/machine_learning/blob/main/NLP-fine_tunning-hugging_face_model.ipynb
Training hyperparameters
The following hyperparameters were used during training:
- learning_rate: 5e-05
- train_batch_size: 8
- eval_batch_size: 8
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 3
Training results
Training Loss | Epoch | Step | Validation Loss | Accuracy |
---|---|---|---|---|
0.1711 | 0.65 | 500 | 0.0821 | 0.9716 |
0.1022 | 1.29 | 1000 | 0.1148 | 0.9709 |
0.0595 | 1.94 | 1500 | 0.1178 | 0.9787 |
0.0348 | 2.59 | 2000 | 0.0951 | 0.9851 |
Framework versions
- Transformers 4.30.2
- Pytorch 2.0.1+cu118
- Datasets 2.13.0
- Tokenizers 0.13.3