blabla

by jpodivin - opened Jan 26

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

+57826

-37979

This PR is in draft mode

Files changed (12) hide show

README.md +12 -22
model.safetensors +1 -1
runs/Jan29_10-41-31_ac36066877f7/events.out.tfevents.1706524892.ac36066877f7.188.0 +0 -3
runs/Jan29_11-11-11_ac36066877f7/events.out.tfevents.1706526672.ac36066877f7.188.1 +0 -3
runs/Jan29_11-26-30_ac36066877f7/events.out.tfevents.1706527591.ac36066877f7.188.2 +0 -3
runs/Jan29_11-26-52_ac36066877f7/events.out.tfevents.1706527613.ac36066877f7.188.3 +0 -3
runs/Jan29_12-30-31_fa2c09ae072b/events.out.tfevents.1706531432.fa2c09ae072b.182.0 +0 -3
special_tokens_map.json +5 -35
tokenizer.json +0 -0
tokenizer_config.json +4 -4
training_args.bin +1 -1
vocab.txt +0 -0

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0857
 ## Model description
@@ -40,32 +40,22 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 20
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| No log        | 1.0   | 9    | 5.4568          |
-| No log        | 2.0   | 18   | 4.7897          |
-| No log        | 3.0   | 27   | 4.6445          |
-| No log        | 4.0   | 36   | 3.9367          |
-| No log        | 5.0   | 45   | 3.4457          |
-| No log        | 6.0   | 54   | 3.3149          |
-| No log        | 7.0   | 63   | 2.6427          |
-| No log        | 8.0   | 72   | 2.6698          |
-| No log        | 9.0   | 81   | 2.2418          |
-| No log        | 10.0  | 90   | 2.3653          |
-| No log        | 11.0  | 99   | 2.1887          |
-| No log        | 12.0  | 108  | 2.1629          |
-| No log        | 13.0  | 117  | 2.2699          |
-| No log        | 14.0  | 126  | 2.1080          |
-| No log        | 15.0  | 135  | 2.1836          |
-| No log        | 16.0  | 144  | 2.0967          |
-| No log        | 17.0  | 153  | 2.1418          |
-| No log        | 18.0  | 162  | 2.0863          |
-| No log        | 19.0  | 171  | 2.0778          |
-| No log        | 20.0  | 180  | 2.0857          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4325
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 27   | 0.5354          |
+| No log        | 2.0   | 54   | 0.5203          |
+| No log        | 3.0   | 81   | 0.4536          |
+| No log        | 4.0   | 108  | 0.4493          |
+| No log        | 5.0   | 135  | 0.3256          |
+| No log        | 6.0   | 162  | 0.4192          |
+| No log        | 7.0   | 189  | 0.5078          |
+| No log        | 8.0   | 216  | 0.4952          |
+| No log        | 9.0   | 243  | 0.4654          |
+| No log        | 10.0  | 270  | 0.4325          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:15112075ccd15586c477dca41b2a9b0e170ef69ee4a890a425b94e47abc8a71e
 size 260782152

 version https://git-lfs.github.com/spec/v1
+oid sha256:eaf943348a2d4817cc5bd23d1c4351fbb4bf7553c9b96707e00b963e265ce53f
 size 260782152

runs/Jan29_10-41-31_ac36066877f7/events.out.tfevents.1706524892.ac36066877f7.188.0 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:50c5acb0d284927977e645c1ee996d4f473f4eea50cb4d2b5a224ec6d1aeda8e
-size 7126

runs/Jan29_11-11-11_ac36066877f7/events.out.tfevents.1706526672.ac36066877f7.188.1 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:664f10c41194d8003cf8dd4bc5c5b7d3b21bd93b60f76a506e6e210fcab14ee6
-size 7105

runs/Jan29_11-26-30_ac36066877f7/events.out.tfevents.1706527591.ac36066877f7.188.2 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:910fed5ffeaa1e0e361f262746e6f6e75a23513bd6e5e55eab9bd171ba1f7822
-size 4363

runs/Jan29_11-26-52_ac36066877f7/events.out.tfevents.1706527613.ac36066877f7.188.3 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:5ef6341bf23265c11166206d08113130286a58427f4bf560422add4f83add726
-size 17931

runs/Jan29_12-30-31_fa2c09ae072b/events.out.tfevents.1706531432.fa2c09ae072b.182.0 DELETED Viewed

@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:3e24c17b22641e6f21099fe78c17712501e8c756faabe599776d88b099bee2a6
-size 9801

special_tokens_map.json CHANGED Viewed

@@ -1,37 +1,7 @@
 {
-  "cls_token": {
-    "content": "[CLS]",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
-  "mask_token": {
-    "content": "[MASK]",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
-  "pad_token": {
-    "content": "[PAD]",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
-  "sep_token": {
-    "content": "[SEP]",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  },
-  "unk_token": {
-    "content": "[UNK]",
-    "lstrip": false,
-    "normalized": false,
-    "rstrip": false,
-    "single_word": false
-  }
 }

 {
+  "cls_token": "[CLS]",
+  "mask_token": "[MASK]",
+  "pad_token": "[PAD]",
+  "sep_token": "[SEP]",
+  "unk_token": "[UNK]"
 }

tokenizer.json CHANGED Viewed

The diff for this file is too large to render. See raw diff

tokenizer_config.json CHANGED Viewed

@@ -8,7 +8,7 @@
       "single_word": false,
       "special": true
     },
-    "1": {
       "content": "[UNK]",
       "lstrip": false,
       "normalized": false,
@@ -16,7 +16,7 @@
       "single_word": false,
       "special": true
     },
-    "2": {
       "content": "[CLS]",
       "lstrip": false,
       "normalized": false,
@@ -24,7 +24,7 @@
       "single_word": false,
       "special": true
     },
-    "3": {
       "content": "[SEP]",
       "lstrip": false,
       "normalized": false,
@@ -32,7 +32,7 @@
       "single_word": false,
       "special": true
     },
-    "4": {
       "content": "[MASK]",
       "lstrip": false,
       "normalized": false,

       "single_word": false,
       "special": true
     },
+    "100": {
       "content": "[UNK]",
       "lstrip": false,
       "normalized": false,
       "single_word": false,
       "special": true
     },
+    "101": {
       "content": "[CLS]",
       "lstrip": false,
       "normalized": false,
       "single_word": false,
       "special": true
     },
+    "102": {
       "content": "[SEP]",
       "lstrip": false,
       "normalized": false,
       "single_word": false,
       "special": true
     },
+    "103": {
       "content": "[MASK]",
       "lstrip": false,
       "normalized": false,

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c019675e8bc0e3d87d85ac181b173adf6a80b25da4b42a6b45c055501fda6bc7
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:ce2970cbf7d43688f9538f74ce307b4b2debbb6c46c65cc04f1a78187e04062e
 size 4600

vocab.txt CHANGED Viewed

The diff for this file is too large to render. See raw diff