jpodivin commited on
Commit
4f43b44
·
verified ·
1 Parent(s): c37ec24

End of training

Browse files
Files changed (2) hide show
  1. README.md +22 -52
  2. model.safetensors +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 4.9421
19
 
20
  ## Model description
21
 
@@ -40,62 +40,32 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 50
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | No log | 1.0 | 9 | 5.7606 |
50
- | No log | 2.0 | 18 | 5.2134 |
51
- | No log | 3.0 | 27 | 4.6264 |
52
- | No log | 4.0 | 36 | 4.6808 |
53
- | No log | 5.0 | 45 | 4.6072 |
54
- | No log | 6.0 | 54 | 4.0322 |
55
- | No log | 7.0 | 63 | 4.5964 |
56
- | No log | 8.0 | 72 | 3.3412 |
57
- | No log | 9.0 | 81 | 3.3570 |
58
- | No log | 10.0 | 90 | 3.6391 |
59
- | No log | 11.0 | 99 | 3.4018 |
60
- | No log | 12.0 | 108 | 3.4077 |
61
- | No log | 13.0 | 117 | 3.6908 |
62
- | No log | 14.0 | 126 | 4.0146 |
63
- | No log | 15.0 | 135 | 3.8527 |
64
- | No log | 16.0 | 144 | 3.9721 |
65
- | No log | 17.0 | 153 | 3.9417 |
66
- | No log | 18.0 | 162 | 3.8664 |
67
- | No log | 19.0 | 171 | 3.9009 |
68
- | No log | 20.0 | 180 | 3.9753 |
69
- | No log | 21.0 | 189 | 4.0617 |
70
- | No log | 22.0 | 198 | 4.1928 |
71
- | No log | 23.0 | 207 | 4.2910 |
72
- | No log | 24.0 | 216 | 4.2968 |
73
- | No log | 25.0 | 225 | 4.5358 |
74
- | No log | 26.0 | 234 | 4.3997 |
75
- | No log | 27.0 | 243 | 4.3952 |
76
- | No log | 28.0 | 252 | 4.5261 |
77
- | No log | 29.0 | 261 | 4.4806 |
78
- | No log | 30.0 | 270 | 4.5944 |
79
- | No log | 31.0 | 279 | 4.2314 |
80
- | No log | 32.0 | 288 | 4.5624 |
81
- | No log | 33.0 | 297 | 4.6074 |
82
- | No log | 34.0 | 306 | 4.2452 |
83
- | No log | 35.0 | 315 | 4.6662 |
84
- | No log | 36.0 | 324 | 4.2687 |
85
- | No log | 37.0 | 333 | 4.7763 |
86
- | No log | 38.0 | 342 | 4.7474 |
87
- | No log | 39.0 | 351 | 4.7573 |
88
- | No log | 40.0 | 360 | 4.7578 |
89
- | No log | 41.0 | 369 | 4.9119 |
90
- | No log | 42.0 | 378 | 4.9752 |
91
- | No log | 43.0 | 387 | 4.9227 |
92
- | No log | 44.0 | 396 | 4.8575 |
93
- | No log | 45.0 | 405 | 4.8022 |
94
- | No log | 46.0 | 414 | 4.9180 |
95
- | No log | 47.0 | 423 | 4.9006 |
96
- | No log | 48.0 | 432 | 4.9202 |
97
- | No log | 49.0 | 441 | 4.9380 |
98
- | No log | 50.0 | 450 | 4.9421 |
99
 
100
 
101
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.0857
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 20
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 9 | 5.4568 |
50
+ | No log | 2.0 | 18 | 4.7897 |
51
+ | No log | 3.0 | 27 | 4.6445 |
52
+ | No log | 4.0 | 36 | 3.9367 |
53
+ | No log | 5.0 | 45 | 3.4457 |
54
+ | No log | 6.0 | 54 | 3.3149 |
55
+ | No log | 7.0 | 63 | 2.6427 |
56
+ | No log | 8.0 | 72 | 2.6698 |
57
+ | No log | 9.0 | 81 | 2.2418 |
58
+ | No log | 10.0 | 90 | 2.3653 |
59
+ | No log | 11.0 | 99 | 2.1887 |
60
+ | No log | 12.0 | 108 | 2.1629 |
61
+ | No log | 13.0 | 117 | 2.2699 |
62
+ | No log | 14.0 | 126 | 2.1080 |
63
+ | No log | 15.0 | 135 | 2.1836 |
64
+ | No log | 16.0 | 144 | 2.0967 |
65
+ | No log | 17.0 | 153 | 2.1418 |
66
+ | No log | 18.0 | 162 | 2.0863 |
67
+ | No log | 19.0 | 171 | 2.0778 |
68
+ | No log | 20.0 | 180 | 2.0857 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
69
 
70
 
71
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c43bca7e7979a895deb81dadc61d7f893d5b95dd6683b219491a1acc974ffb70
3
  size 260782152
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:15112075ccd15586c477dca41b2a9b0e170ef69ee4a890a425b94e47abc8a71e
3
  size 260782152