vivirocks commited on
Commit
95833e0
·
verified ·
1 Parent(s): 2c497c0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -6
README.md CHANGED
@@ -9,14 +9,41 @@ tags:
9
  license: apache-2.0
10
  language:
11
  - en
 
 
 
 
12
  ---
13
 
14
- # Uploaded model
15
 
16
- - **Developed by:** vivirocks
17
- - **License:** apache-2.0
18
- - **Finetuned from model :** unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit
19
 
20
- This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
 
 
 
 
21
 
22
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  license: apache-2.0
10
  language:
11
  - en
12
+ datasets:
13
+ - Amod/mental_health_counseling_conversations
14
+ metrics:
15
+ - accuracy
16
  ---
17
 
18
+ # Model Description
19
 
20
+ This model is a fine-tuned version of unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit, specifically tailored for mental health counseling tasks. It has been trained on the Amod/mental_health_counseling_conversations dataset for 10 epochs using two H100 GPUs.
 
 
21
 
22
+ ## Key Features
23
+ * Base Model: Utilizes the DeepSeek-R1 architecture, known for its powerful reasoning capabilities13.
24
+ * Distillation: Leverages knowledge distillation techniques to compress the larger DeepSeek-R1 model into a more efficient 8B parameter Llama-based version13.
25
+ * Quantization: Employs Unsloth's dynamic 4-bit quantization for reduced memory footprint and faster inference59.
26
+ * Domain Specialization: Fine-tuned on a dataset of mental health counseling conversations, enhancing its ability to understand and respond to mental health-related queries68.
27
 
28
+ ## Training Details
29
+ * Dataset: Amod/mental_health_counseling_conversations, containing 3,512 Q&A pairs from counseling platforms68.
30
+ * Training Duration: 10 epochs
31
+ * Hardware: Two H100 GPUs
32
+
33
+ ## Potential Applications
34
+
35
+ This model could be particularly useful for:
36
+ * Prototyping mental health chatbots
37
+ * Assisting in mental health research
38
+ * Providing initial screening or support in mental health contexts
39
+
40
+ ### Limitations and Ethical Considerations
41
+
42
+ While this model has been trained on mental health counseling data, it's crucial to note:
43
+ * It should not replace professional mental health care or diagnosis.
44
+ * The model may have biases or limitations based on its training data.
45
+ * Ethical use and privacy considerations are paramount when dealing with sensitive mental health information.
46
+
47
+
48
+
49
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="150"/>](https://github.com/unslothai/unsloth)