stulcrad
/

CNEC1_1_extended_xlm-roberta-large

@@ -25,16 +25,16 @@ model-index:
     metrics:
     - name: Precision
       type: precision
-      value: 0.8641078838174274
     - name: Recall
       type: recall
-      value: 0.8904329235702833
     - name: F1
       type: f1
-      value: 0.877072913924717
     - name: Accuracy
       type: accuracy
-      value: 0.970074812967581
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -44,11 +44,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the cnec dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1937
-- Precision: 0.8641
-- Recall: 0.8904
-- F1: 0.8771
-- Accuracy: 0.9701
 ## Model description
@@ -68,27 +68,22 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 4
-- eval_batch_size: 4
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.3037        | 1.0   | 1161  | 0.1876          | 0.7785    | 0.8113 | 0.7946 | 0.9529   |
-| 0.2331        | 2.0   | 2322  | 0.2008          | 0.8065    | 0.8263 | 0.8163 | 0.9569   |
-| 0.1828        | 3.0   | 3483  | 0.1656          | 0.8332    | 0.8648 | 0.8487 | 0.9648   |
-| 0.1456        | 4.0   | 4644  | 0.1659          | 0.8414    | 0.8675 | 0.8542 | 0.9643   |
-| 0.1237        | 5.0   | 5805  | 0.1746          | 0.8538    | 0.8899 | 0.8715 | 0.9690   |
-| 0.1074        | 6.0   | 6966  | 0.1782          | 0.8584    | 0.8878 | 0.8728 | 0.9691   |
-| 0.097         | 7.0   | 8127  | 0.1802          | 0.8517    | 0.8840 | 0.8676 | 0.9691   |
-| 0.072         | 8.0   | 9288  | 0.1908          | 0.8636    | 0.8867 | 0.875  | 0.9703   |
-| 0.067         | 9.0   | 10449 | 0.1962          | 0.8672    | 0.8936 | 0.8802 | 0.9711   |
-| 0.0636        | 10.0  | 11610 | 0.1937          | 0.8641    | 0.8904 | 0.8771 | 0.9701   |
 ### Framework versions

     metrics:
     - name: Precision
       type: precision
+      value: 0.8533541341653667
     - name: Recall
       type: recall
+      value: 0.8770710849812934
     - name: F1
       type: f1
+      value: 0.8650500790722193
     - name: Accuracy
       type: accuracy
+      value: 0.9670664608320468
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on the cnec dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1498
+- Precision: 0.8534
+- Recall: 0.8771
+- F1: 0.8651
+- Accuracy: 0.9671
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.3961        | 1.0   | 581  | 0.1800          | 0.8004    | 0.8231 | 0.8116 | 0.9560   |
+| 0.1772        | 2.0   | 1162 | 0.1518          | 0.8357    | 0.8648 | 0.8500 | 0.9642   |
+| 0.1266        | 3.0   | 1743 | 0.1545          | 0.8377    | 0.8717 | 0.8544 | 0.9680   |
+| 0.1043        | 4.0   | 2324 | 0.1472          | 0.8473    | 0.8691 | 0.8580 | 0.9656   |
+| 0.0804        | 5.0   | 2905 | 0.1498          | 0.8534    | 0.8771 | 0.8651 | 0.9671   |
 ### Framework versions

config.json CHANGED Viewed

@@ -8,7 +8,7 @@
   "classifier_dropout": null,
   "eos_token_id": 2,
   "hidden_act": "gelu",
-  "hidden_dropout_prob": 0.3,
   "hidden_size": 1024,
   "id2label": {
     "0": "O",

   "classifier_dropout": null,
   "eos_token_id": 2,
   "hidden_act": "gelu",
+  "hidden_dropout_prob": 0.2,
   "hidden_size": 1024,
   "id2label": {
     "0": "O",

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5edeaf9e0b1f00c8ba12f0bd348c0ada9eb21d1a4b87f8b5afbbc53496bd6893
 size 2235473356

 version https://git-lfs.github.com/spec/v1
+oid sha256:df46b2750ed74f176adba5d7e288d0b10360c90a4b06fd1c50ebf2e7026c1ea5
 size 2235473356

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:88026046f15f5dcd1a7c63f9d988e107838d32a787b3c6ba70a65b6440d455c1
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:df1edf8fec4061938d4b10761bf8d9ecaf1a667dd1c56f2044b0941cb214f45b
 size 4728