End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [deepset/gelectra-large-germanquad](https://huggingface.co/deepset/gelectra-large-germanquad) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0316
 ## Model description
@@ -46,21 +46,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.0762        | 1.0   | 3    | 0.3330          |
-| 0.5877        | 2.0   | 6    | 0.2301          |
-| 0.0194        | 3.0   | 9    | 0.1833          |
-| 0.1341        | 4.0   | 12   | 0.0951          |
-| 0.002         | 5.0   | 15   | 0.0650          |
-| 0.1363        | 6.0   | 18   | 0.0458          |
-| 0.0263        | 7.0   | 21   | 0.0385          |
-| 0.0032        | 8.0   | 24   | 0.0349          |
-| 0.0036        | 9.0   | 27   | 0.0328          |
-| 0.0031        | 10.0  | 30   | 0.0316          |
 ### Framework versions
 - Transformers 4.38.2
-- Pytorch 2.1.0+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

 This model is a fine-tuned version of [deepset/gelectra-large-germanquad](https://huggingface.co/deepset/gelectra-large-germanquad) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 5.3001
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.1726        | 1.0   | 3    | 4.1350          |
+| 1.4685        | 2.0   | 6    | 3.3928          |
+| 0.8648        | 3.0   | 9    | 3.8220          |
+| 0.0119        | 4.0   | 12   | 4.2166          |
+| 0.0093        | 5.0   | 15   | 4.5749          |
+| 0.0156        | 6.0   | 18   | 4.8764          |
+| 0.0225        | 7.0   | 21   | 5.0761          |
+| 0.0035        | 8.0   | 24   | 5.2095          |
+| 0.0057        | 9.0   | 27   | 5.2750          |
+| 0.025         | 10.0  | 30   | 5.3001          |
 ### Framework versions
 - Transformers 4.38.2
+- Pytorch 2.2.1+cu121
 - Datasets 2.18.0
 - Tokenizers 0.15.2

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:23e54d9988aa1586db2087ceedabe442dd01574719d192b9d02aa16a52b9a05e
 size 1338801016

 version https://git-lfs.github.com/spec/v1
+oid sha256:4510932ecb0308d77350910ef65e1a2491526d38d259c9653927ca7bfeb7da33
 size 1338801016

runs/Mar14_15-22-37_d3883cb7c15e/events.out.tfevents.1710429758.d3883cb7c15e.329.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:711f30f8c122c975bc7e03c3fa1090342bb332feee2a4c9a3cc6c3aa3f870dcf
+size 14011

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:312caaa79698495e113c8ef6987cc5ac5f906c25e2806cbf000514edc2097f99
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:698f2406dcadb1b6bb200a7cf34b99fe0681b202d655d42c77dbf94c07215f30
 size 4920