Update README.md
Browse files
README.md
CHANGED
@@ -37,6 +37,17 @@ should probably proofread and complete it, then remove this comment. -->
|
|
37 |
|
38 |
# depression-reddit-distilroberta-base
|
39 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
40 |
This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the mrjunos/depression-reddit-cleaned dataset.
|
41 |
It achieves the following results on the evaluation set:
|
42 |
- Loss: 0.0821
|
@@ -55,7 +66,10 @@ The model has been trained on a dataset of English-language posts, so it may not
|
|
55 |
## Training and evaluation data
|
56 |
|
57 |
The model was trained on the mrjunos/depression-reddit-cleaned dataset, which contains approximately 7,000 labeled instances.
|
58 |
-
The data was split into Train and Test using
|
|
|
|
|
|
|
59 |
The dataset consists of two main features: 'text' and 'label'. The 'text' feature contains the text data from Reddit posts related to depression, while the 'label' feature indicates whether a post is classified as depression or not.
|
60 |
|
61 |
## Training procedure
|
|
|
37 |
|
38 |
# depression-reddit-distilroberta-base
|
39 |
|
40 |
+
## Example Pipeline
|
41 |
+
|
42 |
+
```python
|
43 |
+
from transformers import pipeline
|
44 |
+
predict_task = pipeline(model="mrjunos/depression-reddit-distilroberta-base", task="text-classification")
|
45 |
+
predict_task("Stop listing your issues here, use forum instead or open ticket.")
|
46 |
+
```
|
47 |
+
```
|
48 |
+
[{'label': 'not_depression', 'score': 0.9813856482505798}]
|
49 |
+
```
|
50 |
+
|
51 |
This model is a fine-tuned version of [distilroberta-base](https://huggingface.co/distilroberta-base) on the mrjunos/depression-reddit-cleaned dataset.
|
52 |
It achieves the following results on the evaluation set:
|
53 |
- Loss: 0.0821
|
|
|
66 |
## Training and evaluation data
|
67 |
|
68 |
The model was trained on the mrjunos/depression-reddit-cleaned dataset, which contains approximately 7,000 labeled instances.
|
69 |
+
The data was split into Train and Test using:
|
70 |
+
```python
|
71 |
+
ds = ds['train'].train_test_split(test_size=0.2, seed=42)
|
72 |
+
```
|
73 |
The dataset consists of two main features: 'text' and 'label'. The 'text' feature contains the text data from Reddit posts related to depression, while the 'label' feature indicates whether a post is classified as depression or not.
|
74 |
|
75 |
## Training procedure
|