scb10x
/

typhoon2-safety-preview

@@ -11,34 +11,86 @@ Typhoon Safety is a lightweight binary classifier designed to detect harmful con
 Train on mixed of Thai Sensitive topic dataset and Wildguard.
-### Thai Sensitive Topics Distribution
-| Category | English Samples | Thai Samples |
-|----------|----------------|--------------|
-| The Monarchy | 1,380 | 352 |
-| Gambling | 1,075 | 264 |
-| Cannabis | 818 | 201 |
-| Drug Policies | 448 | 111 |
-| Thai-Burmese Border Issues | 442 | 119 |
-| Military and Coup d'États | 297 | 72 |
-| LGBTQ+ Rights | 275 | 75 |
-| Religion and Buddhism | 252 | 57 |
-| Political Corruption | 237 | 58 |
-| Freedom of Speech and Censorship | 218 | 56 |
-| National Identity and Immigration | 216 | 57 |
-| Southern Thailand Insurgency | 211 | 56 |
-| Sex Tourism and Prostitution | 198 | 55 |
-| Student Protests and Activism | 175 | 44 |
-| Cultural Appropriation | 171 | 42 |
-| Human Trafficking | 158 | 39 |
-| Political Divide | 156 | 43 |
-| Foreign Influence | 124 | 30 |
-| Vape | 127 | 24 |
-| COVID-19 Management | 105 | 27 |
-| Migrant Labor Issues | 79 | 23 |
-| Royal Projects and Policies | 55 | 17 |
-| Environmental Issues and Land Rights | 19 | 5 |
-| **Total** | **9,321** | **4,563** |
 ## Model Details
@@ -68,12 +120,10 @@ Train on mixed of Thai Sensitive topic dataset and Wildguard.
 - **Developed by:** [More Information Needed]
-- **Funded by [optional]:** [More Information Needed]
-- **Shared by [optional]:** [More Information Needed]
-- **Model type:** [More Information Needed]
-- **Language(s) (NLP):** [More Information Needed]
-- **License:** [More Information Needed]
-- **Finetuned from model [optional]:** [More Information Needed]
 ## How to Get Started with the Model

 Train on mixed of Thai Sensitive topic dataset and Wildguard.
+this model is trained to predict safety labels on below categories.
+<div class="section-header">Thai Sensitive Topics</div>
+<table align="center">
+  <tr>
+    <th colspan="3">Category</th>
+  </tr>
+  <tr>
+    <td>The Monarchy</td>
+    <td>Student Protests and Activism</td>
+    <td>Drug Policies</td>
+  </tr>
+  <tr>
+    <td>Gambling</td>
+    <td>Cultural Appropriation</td>
+    <td>Thai-Burmese Border Issues</td>
+  </tr>
+  <tr>
+    <td>Cannabis</td>
+    <td>Human Trafficking</td>
+    <td>Military and Coup/td>
+  </tr>
+  <tr>
+    <td>LGBTQ+ Rights</td>
+    <td>Political Divide</td>
+    <td>Religion and Buddhism</td>
+  </tr>
+  <tr>
+    <td>Political Corruption</td>
+    <td>Foreign Influence</td>
+    <td>National Identity and Immigration</td>
+  </tr>
+  <tr>
+    <td>Freedom of Speech and Censorship</td>
+    <td>Vape</td>
+    <td>Southern Thailand Insurgency</td>
+  </tr>
+  <tr>
+    <td>Sex Tourism and Prostitution</td>
+    <td>COVID-19 Management</td>
+    <td>Royal Projects and Policies</td>
+  </tr>
+  <tr>
+    <td>Migrant Labor Issues</td>
+    <td>Environmental Issues and Land Rights</td>
+    <td></td>
+  </tr>
+</table>
+<div class="section-header">Wildguard Topics</div>
+<table>
+  <tr>
+    <th colspan="3">Category</th>
+  </tr>
+  <tr>
+    <td>Others</td>
+    <td>Sensitive Information Organization</td>
+    <td>Mental Health Over-reliance Crisis</td>
+  </tr>
+  <tr>
+    <td>Social Stereotypes & Discrimination</td>
+    <td>Defamation & Unethical Actions</td>
+    <td>Cyberattack</td>
+  </tr>
+  <tr>
+    <td>Disseminating False Information</td>
+    <td>Private Information Individual</td>
+    <td>Copyright Violations</td>
+  </tr>
+  <tr>
+    <td>Toxic Language & Hate Speech</td>
+    <td>Fraud Assisting Illegal Activities</td>
+    <td>Causing Material Harm by Misinformation</td>
+  </tr>
+  <tr>
+    <td>Violence and Physical Harm</td>
+    <td>Sexual Content</td>
+    <td></td>
+  </tr>
+</table>
 ## Model Details
 - **Developed by:** [More Information Needed]
+- **Model type:** Transformer Encoder
+- **Language(s) (NLP):** Thai 🇹🇭 and English 🇬🇧
+- **License:** MIT
+- **Finetuned from model [optional]:** mDeBERTa v3 base https://huggingface.co/microsoft/mdeberta-v3-base
 ## How to Get Started with the Model