Update README.md
Browse files
README.md
CHANGED
@@ -9,14 +9,41 @@ tags:
|
|
9 |
license: apache-2.0
|
10 |
language:
|
11 |
- en
|
|
|
|
|
|
|
|
|
12 |
---
|
13 |
|
14 |
-
#
|
15 |
|
16 |
-
-
|
17 |
-
- **License:** apache-2.0
|
18 |
-
- **Finetuned from model :** unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
|
19 |
|
20 |
-
|
|
|
|
|
|
|
|
|
21 |
|
22 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
license: apache-2.0
|
10 |
language:
|
11 |
- en
|
12 |
+
datasets:
|
13 |
+
- Amod/mental_health_counseling_conversations
|
14 |
+
metrics:
|
15 |
+
- accuracy
|
16 |
---
|
17 |
|
18 |
+
# Model Description
|
19 |
|
20 |
+
This model is a fine-tuned version of unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit, specifically tailored for mental health counseling tasks. It has been trained on the Amod/mental_health_counseling_conversations dataset for 10 epochs using two H100 GPUs.
|
|
|
|
|
21 |
|
22 |
+
## Key Features
|
23 |
+
* Base Model: Utilizes the DeepSeek-R1 architecture, known for its powerful reasoning capabilities13.
|
24 |
+
* Distillation: Leverages knowledge distillation techniques to compress the larger DeepSeek-R1 model into a more efficient 8B parameter Llama-based version13.
|
25 |
+
* Quantization: Employs Unsloth's dynamic 4-bit quantization for reduced memory footprint and faster inference59.
|
26 |
+
* Domain Specialization: Fine-tuned on a dataset of mental health counseling conversations, enhancing its ability to understand and respond to mental health-related queries68.
|
27 |
|
28 |
+
## Training Details
|
29 |
+
* Dataset: Amod/mental_health_counseling_conversations, containing 3,512 Q&A pairs from counseling platforms68.
|
30 |
+
* Training Duration: 10 epochs
|
31 |
+
* Hardware: Two H100 GPUs
|
32 |
+
|
33 |
+
## Potential Applications
|
34 |
+
|
35 |
+
This model could be particularly useful for:
|
36 |
+
* Prototyping mental health chatbots
|
37 |
+
* Assisting in mental health research
|
38 |
+
* Providing initial screening or support in mental health contexts
|
39 |
+
|
40 |
+
### Limitations and Ethical Considerations
|
41 |
+
|
42 |
+
While this model has been trained on mental health counseling data, it's crucial to note:
|
43 |
+
* It should not replace professional mental health care or diagnosis.
|
44 |
+
* The model may have biases or limitations based on its training data.
|
45 |
+
* Ethical use and privacy considerations are paramount when dealing with sensitive mental health information.
|
46 |
+
|
47 |
+
|
48 |
+
|
49 |
+
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="150"/>](https://github.com/unslothai/unsloth)
|