lighteternal
/

fact-or-opinion-xlmr-el

Text Classification

fact-or-opinion

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

lighteternal commited on Feb 27, 2022

Commit

dec7c0c

•

1 Parent(s): f730ab2

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -26,12 +26,15 @@ license: apache-2.0
 This is an XLM-Roberta-base model with a binary classification head. Given a sentence, it can classify it either as a fact or an opinion based on its content.
-You can use this model in any of the XLM-R supported languages for the same task, taking advantage of its 0-shot learning capabilities.
-HuggingFace API labels:
 * Label 0: Opinion/Subjective sentence
 * Label 1: Fact/Objective sentence
 The original dataset (available here: https://github.com/1024er/cbert_aug/tree/crayon/datasets/subj) contained aprox. 9000 annotated sentences (classified as subjective or objective). It was translated to Greek using Google Translate. The Greek version was then concatenated with the original English one to create the mixed EN-EL dataset.
 The model was trained for 5 epochs, using batch size = 8. Detailed metrics and hyperparameters available on the "Metrics" tab.
@@ -44,5 +47,6 @@ The model was trained for 5 epochs, using batch size = 8. Detailed metrics and h
 ## Acknowledgement
 The research work was supported by the Hellenic Foundation for Research and Innovation (HFRI) under the HFRI PhD Fellowship grant (Fellowship Number:50, 2nd call)

 This is an XLM-Roberta-base model with a binary classification head. Given a sentence, it can classify it either as a fact or an opinion based on its content.
+You can use this model in any of the XLM-R supported languages for the same task, taking advantage of its 0-shot learning capabilities. However, the model was trained only using English and Greek sentences.
+Legend of HuggingFace API labels:
 * Label 0: Opinion/Subjective sentence
 * Label 1: Fact/Objective sentence
+## Dataset training info
 The original dataset (available here: https://github.com/1024er/cbert_aug/tree/crayon/datasets/subj) contained aprox. 9000 annotated sentences (classified as subjective or objective). It was translated to Greek using Google Translate. The Greek version was then concatenated with the original English one to create the mixed EN-EL dataset.
 The model was trained for 5 epochs, using batch size = 8. Detailed metrics and hyperparameters available on the "Metrics" tab.
 ## Acknowledgement
 The research work was supported by the Hellenic Foundation for Research and Innovation (HFRI) under the HFRI PhD Fellowship grant (Fellowship Number:50, 2nd call)