Improve model card and add paper abstract
#1
by
nielsr
HF staff
- opened
README.md
CHANGED
@@ -2,19 +2,20 @@
|
|
2 |
language:
|
3 |
- vi
|
4 |
library_name: transformers
|
|
|
|
|
5 |
tags:
|
6 |
- SemViQA
|
7 |
- three-class-classification
|
8 |
- fact-checking
|
9 |
-
|
10 |
-
license: mit
|
11 |
---
|
12 |
|
13 |
# SemViQA-TC: Vietnamese Three-class Classification for Claim Verification
|
14 |
|
15 |
## Model Description
|
16 |
|
17 |
-
**SemViQA-TC** is one of the key components of the **SemViQA** system, designed for **three-class classification** in Vietnamese fact-checking. This model classifies a given claim into one of three categories: **SUPPORTED**, **REFUTED**, or **NOT ENOUGH INFORMATION (NEI)** based on retrieved evidence.
|
18 |
|
19 |
### **Model Information**
|
20 |
- **Developed by:** [SemViQA Research Team](https://huggingface.co/SemViQA)
|
@@ -23,7 +24,14 @@ license: mit
|
|
23 |
- **Task:** Three-Class Classification (Fact Verification)
|
24 |
- **Dataset:** [ISE-DSC01](https://codalab.lisn.upsaclay.fr/competitions/15497)
|
25 |
|
26 |
-
SemViQA-TC serves as the **first step in the two-step classification process** of the SemViQA system. It initially categorizes claims into three classes: **SUPPORTED, REFUTED, or NEI**. For claims classified as **SUPPORTED** or **REFUTED**, a secondary **binary classification model (SemViQA-BC)** further refines the prediction. This hierarchical classification strategy enhances the accuracy of fact verification.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
27 |
|
28 |
## Usage Example
|
29 |
|
|
|
2 |
language:
|
3 |
- vi
|
4 |
library_name: transformers
|
5 |
+
license: mit
|
6 |
+
pipeline_tag: text-classification
|
7 |
tags:
|
8 |
- SemViQA
|
9 |
- three-class-classification
|
10 |
- fact-checking
|
11 |
+
hf_hub_url: SemViQA/tc-infoxlm-isedsc01
|
|
|
12 |
---
|
13 |
|
14 |
# SemViQA-TC: Vietnamese Three-class Classification for Claim Verification
|
15 |
|
16 |
## Model Description
|
17 |
|
18 |
+
**SemViQA-TC** is one of the key components of the **SemViQA** system, designed for **three-class classification** in Vietnamese fact-checking. This model classifies a given claim into one of three categories: **SUPPORTED**, **REFUTED**, or **NOT ENOUGH INFORMATION (NEI)** based on retrieved evidence. This model contributes to addressing the growing need for robust fact-checking solutions, particularly for low-resource languages like Vietnamese, where existing methods often struggle with semantic nuances and complex linguistic structures. SemViQA aims to balance precision and speed in fact verification.
|
19 |
|
20 |
### **Model Information**
|
21 |
- **Developed by:** [SemViQA Research Team](https://huggingface.co/SemViQA)
|
|
|
24 |
- **Task:** Three-Class Classification (Fact Verification)
|
25 |
- **Dataset:** [ISE-DSC01](https://codalab.lisn.upsaclay.fr/competitions/15497)
|
26 |
|
27 |
+
SemViQA-TC serves as the **first step in the two-step classification process** of the SemViQA system. It initially categorizes claims into three classes: **SUPPORTED, REFUTED, or NEI**. For claims classified as **SUPPORTED** or **REFUTED**, a secondary **binary classification model (SemViQA-BC)** further refines the prediction. This hierarchical classification strategy enhances the accuracy of fact verification. This approach aims to achieve state-of-the-art results by combining Semantic-based Evidence Retrieval (SER) and Two-step Verdict Classification (TVC).
|
28 |
+
|
29 |
+
### **Model Achievements**
|
30 |
+
- **1st place** in the **UIT Data Science Challenge** 🏅
|
31 |
+
- **State-of-the-art** performance on:
|
32 |
+
- **ISE-DSC01** → **78.97% strict accuracy**
|
33 |
+
- **ViWikiFC** → **80.82% strict accuracy**
|
34 |
+
- **SemViQA Faster**: **7x speed improvement** over the standard model 🚀
|
35 |
|
36 |
## Usage Example
|
37 |
|