stulcrad
/

CNEC2_0_Supertypes_xlm-roberta-large

@@ -1,6 +1,6 @@
 ---
-license: apache-2.0
-base_model: distilbert/distilbert-base-multilingual-cased
 tags:
 - generated_from_trainer
 datasets:
@@ -25,16 +25,16 @@ model-index:
     metrics:
     - name: Precision
       type: precision
-      value: 0.7557829181494662
     - name: Recall
       type: recall
-      value: 0.819980694980695
     - name: F1
       type: f1
-      value: 0.7865740740740742
     - name: Accuracy
       type: accuracy
-      value: 0.9568269568269568
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -42,13 +42,13 @@ should probably proofread and complete it, then remove this comment. -->
 # CNEC2_0_Supertypes_xlm-roberta-large
-This model is a fine-tuned version of [distilbert/distilbert-base-multilingual-cased](https://huggingface.co/distilbert/distilbert-base-multilingual-cased) on the cnec dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2049
-- Precision: 0.7558
-- Recall: 0.8200
-- F1: 0.7866
-- Accuracy: 0.9568
 ## Model description
@@ -68,12 +68,12 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_ratio: 0.01
 - lr_scheduler_warmup_steps: 1000
 - num_epochs: 10
@@ -81,15 +81,24 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.7025        | 1.11  | 500  | 0.2950          | 0.5066    | 0.5927 | 0.5463 | 0.9128   |
-| 0.2152        | 2.22  | 1000 | 0.2057          | 0.6733    | 0.7539 | 0.7113 | 0.9425   |
-| 0.1366        | 3.33  | 1500 | 0.1680          | 0.7228    | 0.7891 | 0.7545 | 0.9525   |
-| 0.0849        | 4.44  | 2000 | 0.1710          | 0.7246    | 0.7987 | 0.7599 | 0.9540   |
-| 0.0574        | 5.56  | 2500 | 0.1725          | 0.7309    | 0.8166 | 0.7714 | 0.9558   |
-| 0.0384        | 6.67  | 3000 | 0.1855          | 0.7327    | 0.8243 | 0.7758 | 0.9554   |
-| 0.0292        | 7.78  | 3500 | 0.1944          | 0.7557    | 0.8287 | 0.7905 | 0.9573   |
-| 0.0208        | 8.89  | 4000 | 0.2053          | 0.7486    | 0.8118 | 0.7789 | 0.9555   |
-| 0.0164        | 10.0  | 4500 | 0.2049          | 0.7558    | 0.8200 | 0.7866 | 0.9568   |
 ### Framework versions

 ---
+license: mit
+base_model: FacebookAI/xlm-roberta-large
 tags:
 - generated_from_trainer
 datasets:
     metrics:
     - name: Precision
       type: precision
+      value: 0.8214447978191731
     - name: Recall
       type: recall
+      value: 0.8725868725868726
     - name: F1
       type: f1
+      value: 0.8462438567750995
     - name: Accuracy
       type: accuracy
+      value: 0.9689700130378096
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # CNEC2_0_Supertypes_xlm-roberta-large
+This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the cnec dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1759
+- Precision: 0.8214
+- Recall: 0.8726
+- F1: 0.8462
+- Accuracy: 0.9690
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_ratio: 0.1
 - lr_scheduler_warmup_steps: 1000
 - num_epochs: 10
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.9224        | 0.56  | 500  | 0.2309          | 0.5594    | 0.6863 | 0.6164 | 0.9358   |
+| 0.2449        | 1.11  | 1000 | 0.1960          | 0.6745    | 0.8142 | 0.7378 | 0.9525   |
+| 0.204         | 1.67  | 1500 | 0.1701          | 0.7256    | 0.8079 | 0.7646 | 0.9571   |
+| 0.1694        | 2.22  | 2000 | 0.1526          | 0.7605    | 0.8567 | 0.8057 | 0.9640   |
+| 0.1392        | 2.78  | 2500 | 0.1607          | 0.7697    | 0.8485 | 0.8072 | 0.9620   |
+| 0.1191        | 3.33  | 3000 | 0.1528          | 0.7969    | 0.8596 | 0.8270 | 0.9646   |
+| 0.1128        | 3.89  | 3500 | 0.1552          | 0.7668    | 0.8711 | 0.8156 | 0.9610   |
+| 0.095         | 4.44  | 4000 | 0.1678          | 0.7658    | 0.8615 | 0.8108 | 0.9632   |
+| 0.0979        | 5.0   | 4500 | 0.1432          | 0.8079    | 0.8625 | 0.8343 | 0.9672   |
+| 0.0764        | 5.56  | 5000 | 0.1548          | 0.8098    | 0.8528 | 0.8307 | 0.9671   |
+| 0.0829        | 6.11  | 5500 | 0.1423          | 0.8128    | 0.8653 | 0.8382 | 0.9672   |
+| 0.0648        | 6.67  | 6000 | 0.1548          | 0.8038    | 0.8760 | 0.8383 | 0.9673   |
+| 0.0529        | 7.22  | 6500 | 0.1653          | 0.8139    | 0.8716 | 0.8418 | 0.9675   |
+| 0.0483        | 7.78  | 7000 | 0.1630          | 0.8186    | 0.8649 | 0.8411 | 0.9680   |
+| 0.0494        | 8.33  | 7500 | 0.1709          | 0.8233    | 0.8682 | 0.8452 | 0.9686   |
+| 0.0389        | 8.89  | 8000 | 0.1757          | 0.8211    | 0.8726 | 0.8460 | 0.9687   |
+| 0.0356        | 9.44  | 8500 | 0.1740          | 0.8242    | 0.8736 | 0.8482 | 0.9692   |
+| 0.0337        | 10.0  | 9000 | 0.1759          | 0.8214    | 0.8726 | 0.8462 | 0.9690   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3f8e2a42c86f93502010626d1f2edc53becd89dc0824cd8295bd34fe6cf2722b
 size 2235481556

 version https://git-lfs.github.com/spec/v1
+oid sha256:26f3437a686a3428d5adec52a846c64dd892fb8244cdb104d06ee74779aa5c8c
 size 2235481556