lighteternal commited on
Commit
f730ab2
1 Parent(s): be45a88

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -3
README.md CHANGED
@@ -21,17 +21,20 @@ pipeline_tag: text-classification
21
  license: apache-2.0
22
  ---
23
 
24
- # Binary classifier to distinguish between fact (1) or opinion (0) trained on a mixed Greek and English annotated dataset.
25
  ### By the Hellenic Army Academy (SSE) and the Technical University of Crete (TUC)
26
 
27
 
28
- This is an XLM-Roberta-base model with a binary classification head. Given a sentence it can classify it either as a fact or an opinion, based on its content.
 
29
 
30
  HuggingFace API labels:
31
  * Label 0: Opinion/Subjective sentence
32
  * Label 1: Fact/Objective sentence
33
 
34
- The original dataset (available here: https://github.com/1024er/cbert_aug/tree/crayon/datasets/subj) containing aprox. 9000 annotated sentences (classified as subjective or objective) was translated to Greek using Google Translate. The Greek version was concatenated with the original English one to create the mixed EN-EL dataset.
 
 
35
 
36
  ## Evaluation Results on test set
37
 
 
21
  license: apache-2.0
22
  ---
23
 
24
+ # Fact vs. opinion binary classifier, trained on a mixed EN-EL annotated corpus.
25
  ### By the Hellenic Army Academy (SSE) and the Technical University of Crete (TUC)
26
 
27
 
28
+ This is an XLM-Roberta-base model with a binary classification head. Given a sentence, it can classify it either as a fact or an opinion based on its content.
29
+ You can use this model in any of the XLM-R supported languages for the same task, taking advantage of its 0-shot learning capabilities.
30
 
31
  HuggingFace API labels:
32
  * Label 0: Opinion/Subjective sentence
33
  * Label 1: Fact/Objective sentence
34
 
35
+ The original dataset (available here: https://github.com/1024er/cbert_aug/tree/crayon/datasets/subj) contained aprox. 9000 annotated sentences (classified as subjective or objective). It was translated to Greek using Google Translate. The Greek version was then concatenated with the original English one to create the mixed EN-EL dataset.
36
+
37
+ The model was trained for 5 epochs, using batch size = 8. Detailed metrics and hyperparameters available on the "Metrics" tab.
38
 
39
  ## Evaluation Results on test set
40