End of training

Browse files

Files changed (5) hide show

README.md +41 -41
model.safetensors +1 -1
runs/Mar04_14-32-11_7eccf1ba7969/events.out.tfevents.1709562732.7eccf1ba7969.331.5 +3 -0
tokenizer.json +18 -18
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0005
 ## Model description
@@ -44,46 +44,46 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 2.1076        | 1.0   | 14   | 1.5802          |
-| 1.2151        | 2.0   | 28   | 0.6870          |
-| 0.4194        | 3.0   | 42   | 0.3199          |
-| 0.2906        | 4.0   | 56   | 0.2531          |
-| 0.2292        | 5.0   | 70   | 0.1898          |
-| 0.1716        | 6.0   | 84   | 0.1377          |
-| 0.1194        | 7.0   | 98   | 0.0855          |
-| 0.0728        | 8.0   | 112  | 0.0450          |
-| 0.0414        | 9.0   | 126  | 0.0226          |
-| 0.023         | 10.0  | 140  | 0.0112          |
-| 0.0134        | 11.0  | 154  | 0.0065          |
-| 0.009         | 12.0  | 168  | 0.0045          |
-| 0.0064        | 13.0  | 182  | 0.0030          |
-| 0.0046        | 14.0  | 196  | 0.0023          |
-| 0.0036        | 15.0  | 210  | 0.0019          |
-| 0.0031        | 16.0  | 224  | 0.0017          |
-| 0.0029        | 17.0  | 238  | 0.0015          |
-| 0.0024        | 18.0  | 252  | 0.0013          |
-| 0.0022        | 19.0  | 266  | 0.0012          |
-| 0.0019        | 20.0  | 280  | 0.0011          |
-| 0.0019        | 21.0  | 294  | 0.0010          |
-| 0.0017        | 22.0  | 308  | 0.0009          |
-| 0.0016        | 23.0  | 322  | 0.0009          |
-| 0.0015        | 24.0  | 336  | 0.0008          |
-| 0.0014        | 25.0  | 350  | 0.0008          |
-| 0.0013        | 26.0  | 364  | 0.0007          |
-| 0.0012        | 27.0  | 378  | 0.0007          |
-| 0.0012        | 28.0  | 392  | 0.0007          |
-| 0.0011        | 29.0  | 406  | 0.0007          |
-| 0.0011        | 30.0  | 420  | 0.0006          |
-| 0.001         | 31.0  | 434  | 0.0006          |
-| 0.001         | 32.0  | 448  | 0.0006          |
-| 0.001         | 33.0  | 462  | 0.0006          |
-| 0.001         | 34.0  | 476  | 0.0006          |
-| 0.0009        | 35.0  | 490  | 0.0006          |
-| 0.0009        | 36.0  | 504  | 0.0006          |
-| 0.0009        | 37.0  | 518  | 0.0006          |
-| 0.0009        | 38.0  | 532  | 0.0005          |
-| 0.0009        | 39.0  | 546  | 0.0005          |
-| 0.0009        | 40.0  | 560  | 0.0005          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0011
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 2.3492        | 1.0   | 11   | 1.8446          |
+| 1.6854        | 2.0   | 22   | 1.3965          |
+| 1.1935        | 3.0   | 33   | 0.8348          |
+| 0.5028        | 4.0   | 44   | 0.3122          |
+| 0.2788        | 5.0   | 55   | 0.2400          |
+| 0.2212        | 6.0   | 66   | 0.1893          |
+| 0.1813        | 7.0   | 77   | 0.1544          |
+| 0.1515        | 8.0   | 88   | 0.1281          |
+| 0.1206        | 9.0   | 99   | 0.0903          |
+| 0.087         | 10.0  | 110  | 0.0571          |
+| 0.058         | 11.0  | 121  | 0.0359          |
+| 0.0378        | 12.0  | 132  | 0.0204          |
+| 0.0249        | 13.0  | 143  | 0.0126          |
+| 0.0169        | 14.0  | 154  | 0.0085          |
+| 0.0123        | 15.0  | 165  | 0.0061          |
+| 0.009         | 16.0  | 176  | 0.0047          |
+| 0.0073        | 17.0  | 187  | 0.0037          |
+| 0.006         | 18.0  | 198  | 0.0031          |
+| 0.0049        | 19.0  | 209  | 0.0024          |
+| 0.0043        | 20.0  | 220  | 0.0023          |
+| 0.0037        | 21.0  | 231  | 0.0020          |
+| 0.0033        | 22.0  | 242  | 0.0019          |
+| 0.0032        | 23.0  | 253  | 0.0018          |
+| 0.003         | 24.0  | 264  | 0.0016          |
+| 0.0025        | 25.0  | 275  | 0.0015          |
+| 0.0024        | 26.0  | 286  | 0.0014          |
+| 0.0023        | 27.0  | 297  | 0.0014          |
+| 0.0022        | 28.0  | 308  | 0.0014          |
+| 0.0021        | 29.0  | 319  | 0.0013          |
+| 0.002         | 30.0  | 330  | 0.0012          |
+| 0.0019        | 31.0  | 341  | 0.0012          |
+| 0.0018        | 32.0  | 352  | 0.0012          |
+| 0.0018        | 33.0  | 363  | 0.0012          |
+| 0.0017        | 34.0  | 374  | 0.0011          |
+| 0.0018        | 35.0  | 385  | 0.0011          |
+| 0.0018        | 36.0  | 396  | 0.0011          |
+| 0.0017        | 37.0  | 407  | 0.0011          |
+| 0.0016        | 38.0  | 418  | 0.0011          |
+| 0.0016        | 39.0  | 429  | 0.0011          |
+| 0.0015        | 40.0  | 440  | 0.0011          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e52afcaa03c3935fc55a3bb9730e2eede2dae99ef5797e03b9b947207f602f28
 size 31168616

 version https://git-lfs.github.com/spec/v1
+oid sha256:f847db2077e1bf95a31752547ed6c3b59b878863c3500d01b1c7dc5a67c66633
 size 31168616

runs/Mar04_14-32-11_7eccf1ba7969/events.out.tfevents.1709562732.7eccf1ba7969.331.5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:56310d7d0b63143e15cccb8e61564e1a570a9e17589fd95495c9516c07639e1d
+size 28087

tokenizer.json CHANGED Viewed

@@ -112,40 +112,40 @@
       "7": 13,
       "8": 14,
       "9": 15,
-      "99": 16,
-      "10": 17,
-      "98": 18,
-      "11": 19,
-      "12": 20,
-      "97": 21,
       "96": 22,
       "13": 23,
-      "14": 24,
-      "95": 25,
-      "94": 26,
-      "15": 27,
-      "93": 28,
-      "16": 29,
       "17": 30,
       "92": 31,
       "91": 32,
       "18": 33
     },
     "merges": [
-      "9 9",
       "1 0",
-      "9 8",
       "1 1",
-      "1 2",
       "9 7",
       "9 6",
       "1 3",
-      "1 4",
       "9 5",
-      "9 4",
       "1 5",
-      "9 3",
       "1 6",
       "1 7",
       "9 2",
       "9 1",

       "7": 13,
       "8": 14,
       "9": 15,
+      "10": 16,
+      "99": 17,
+      "11": 18,
+      "97": 19,
+      "98": 20,
+      "12": 21,
       "96": 22,
       "13": 23,
+      "95": 24,
+      "14": 25,
+      "15": 26,
+      "94": 27,
+      "16": 28,
+      "93": 29,
       "17": 30,
       "92": 31,
       "91": 32,
       "18": 33
     },
     "merges": [
       "1 0",
+      "9 9",
       "1 1",
       "9 7",
+      "9 8",
+      "1 2",
       "9 6",
       "1 3",
       "9 5",
+      "1 4",
       "1 5",
+      "9 4",
       "1 6",
+      "9 3",
       "1 7",
       "9 2",
       "9 1",

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7f199d10fd57b3941617042ed1ca82d970f8737c4ac2a4f6a08d706db6e172bd
 size 5112

 version https://git-lfs.github.com/spec/v1
+oid sha256:47a43f6969838e8280337d2378b5a2216f1b80eceaa53237d630498642e92074
 size 5112