Improve model card and add paper abstract

#1
by nielsr HF staff - opened
Files changed (1) hide show
  1. README.md +12 -4
README.md CHANGED
@@ -2,19 +2,20 @@
2
  language:
3
  - vi
4
  library_name: transformers
 
 
5
  tags:
6
  - SemViQA
7
  - three-class-classification
8
  - fact-checking
9
- pipeline_tag: text-classification
10
- license: mit
11
  ---
12
 
13
  # SemViQA-TC: Vietnamese Three-class Classification for Claim Verification
14
 
15
  ## Model Description
16
 
17
- **SemViQA-TC** is one of the key components of the **SemViQA** system, designed for **three-class classification** in Vietnamese fact-checking. This model classifies a given claim into one of three categories: **SUPPORTED**, **REFUTED**, or **NOT ENOUGH INFORMATION (NEI)** based on retrieved evidence.
18
 
19
  ### **Model Information**
20
  - **Developed by:** [SemViQA Research Team](https://huggingface.co/SemViQA)
@@ -23,7 +24,14 @@ license: mit
23
  - **Task:** Three-Class Classification (Fact Verification)
24
  - **Dataset:** [ISE-DSC01](https://codalab.lisn.upsaclay.fr/competitions/15497)
25
 
26
- SemViQA-TC serves as the **first step in the two-step classification process** of the SemViQA system. It initially categorizes claims into three classes: **SUPPORTED, REFUTED, or NEI**. For claims classified as **SUPPORTED** or **REFUTED**, a secondary **binary classification model (SemViQA-BC)** further refines the prediction. This hierarchical classification strategy enhances the accuracy of fact verification.
 
 
 
 
 
 
 
27
 
28
  ## Usage Example
29
 
 
2
  language:
3
  - vi
4
  library_name: transformers
5
+ license: mit
6
+ pipeline_tag: text-classification
7
  tags:
8
  - SemViQA
9
  - three-class-classification
10
  - fact-checking
11
+ hf_hub_url: SemViQA/tc-infoxlm-isedsc01
 
12
  ---
13
 
14
  # SemViQA-TC: Vietnamese Three-class Classification for Claim Verification
15
 
16
  ## Model Description
17
 
18
+ **SemViQA-TC** is one of the key components of the **SemViQA** system, designed for **three-class classification** in Vietnamese fact-checking. This model classifies a given claim into one of three categories: **SUPPORTED**, **REFUTED**, or **NOT ENOUGH INFORMATION (NEI)** based on retrieved evidence. This model contributes to addressing the growing need for robust fact-checking solutions, particularly for low-resource languages like Vietnamese, where existing methods often struggle with semantic nuances and complex linguistic structures. SemViQA aims to balance precision and speed in fact verification.
19
 
20
  ### **Model Information**
21
  - **Developed by:** [SemViQA Research Team](https://huggingface.co/SemViQA)
 
24
  - **Task:** Three-Class Classification (Fact Verification)
25
  - **Dataset:** [ISE-DSC01](https://codalab.lisn.upsaclay.fr/competitions/15497)
26
 
27
+ SemViQA-TC serves as the **first step in the two-step classification process** of the SemViQA system. It initially categorizes claims into three classes: **SUPPORTED, REFUTED, or NEI**. For claims classified as **SUPPORTED** or **REFUTED**, a secondary **binary classification model (SemViQA-BC)** further refines the prediction. This hierarchical classification strategy enhances the accuracy of fact verification. This approach aims to achieve state-of-the-art results by combining Semantic-based Evidence Retrieval (SER) and Two-step Verdict Classification (TVC).
28
+
29
+ ### **Model Achievements**
30
+ - **1st place** in the **UIT Data Science Challenge** 🏅
31
+ - **State-of-the-art** performance on:
32
+ - **ISE-DSC01** → **78.97% strict accuracy**
33
+ - **ViWikiFC** → **80.82% strict accuracy**
34
+ - **SemViQA Faster**: **7x speed improvement** over the standard model 🚀
35
 
36
  ## Usage Example
37