SoDehghan commited on
Commit
f400f94
·
verified ·
1 Parent(s): 1985b7f

Model save

Browse files
Files changed (3) hide show
  1. README.md +200 -3
  2. model.safetensors +1 -1
  3. tokenizer.json +1 -6
README.md CHANGED
@@ -1,3 +1,200 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ library_name: transformers
3
+ license: mit
4
+ base_model: dbmdz/bert-base-turkish-uncased
5
+ tags:
6
+ - generated_from_trainer
7
+ metrics:
8
+ - f1
9
+ - accuracy
10
+ model-index:
11
+ - name: test-push-to-hf
12
+ results: []
13
+ ---
14
+
15
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
16
+ should probably proofread and complete it, then remove this comment. -->
17
+
18
+ # test-push-to-hf
19
+
20
+ This model is a fine-tuned version of [dbmdz/bert-base-turkish-uncased](https://huggingface.co/dbmdz/bert-base-turkish-uncased) on the None dataset.
21
+ It achieves the following results on the evaluation set:
22
+ - Loss: 0.4143
23
+ - F1: 0.7534
24
+ - Roc Auc: 0.8185
25
+ - Accuracy: 0.6175
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 5e-06
45
+ - train_batch_size: 8
46
+ - eval_batch_size: 20
47
+ - seed: 42
48
+ - optimizer: Use adamw_torch with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 10
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss | F1 | Roc Auc | Accuracy |
55
+ |:-------------:|:------:|:-----:|:---------------:|:------:|:-------:|:--------:|
56
+ | 0.5693 | 0.0729 | 100 | 0.5335 | 0.5484 | 0.6764 | 0.3880 |
57
+ | 0.5283 | 0.1458 | 200 | 0.5057 | 0.5802 | 0.6968 | 0.4281 |
58
+ | 0.4997 | 0.2187 | 300 | 0.4800 | 0.5915 | 0.7047 | 0.4372 |
59
+ | 0.4613 | 0.2915 | 400 | 0.4601 | 0.6105 | 0.7181 | 0.4299 |
60
+ | 0.4431 | 0.3644 | 500 | 0.4189 | 0.6634 | 0.7518 | 0.4982 |
61
+ | 0.4276 | 0.4373 | 600 | 0.3913 | 0.7056 | 0.7797 | 0.5401 |
62
+ | 0.4087 | 0.5102 | 700 | 0.3834 | 0.7036 | 0.7790 | 0.5383 |
63
+ | 0.3971 | 0.5831 | 800 | 0.3754 | 0.7060 | 0.7791 | 0.5537 |
64
+ | 0.4046 | 0.6560 | 900 | 0.3692 | 0.7082 | 0.7816 | 0.5510 |
65
+ | 0.3835 | 0.7289 | 1000 | 0.3705 | 0.7186 | 0.7901 | 0.5765 |
66
+ | 0.3657 | 0.8017 | 1100 | 0.3620 | 0.7260 | 0.7944 | 0.5838 |
67
+ | 0.3838 | 0.8746 | 1200 | 0.3574 | 0.7335 | 0.8005 | 0.5811 |
68
+ | 0.3827 | 0.9475 | 1300 | 0.3486 | 0.7361 | 0.8025 | 0.5883 |
69
+ | 0.3578 | 1.0204 | 1400 | 0.3483 | 0.7415 | 0.8065 | 0.5984 |
70
+ | 0.335 | 1.0933 | 1500 | 0.3436 | 0.7387 | 0.8040 | 0.6066 |
71
+ | 0.3417 | 1.1662 | 1600 | 0.3413 | 0.7453 | 0.8094 | 0.6093 |
72
+ | 0.3465 | 1.2391 | 1700 | 0.3388 | 0.7356 | 0.8027 | 0.6075 |
73
+ | 0.3417 | 1.3120 | 1800 | 0.3361 | 0.7414 | 0.8068 | 0.6047 |
74
+ | 0.3283 | 1.3848 | 1900 | 0.3343 | 0.7491 | 0.8118 | 0.6120 |
75
+ | 0.3132 | 1.4577 | 2000 | 0.3422 | 0.7466 | 0.8122 | 0.5974 |
76
+ | 0.3317 | 1.5306 | 2100 | 0.3377 | 0.7432 | 0.8080 | 0.6066 |
77
+ | 0.3152 | 1.6035 | 2200 | 0.3345 | 0.7422 | 0.8065 | 0.6111 |
78
+ | 0.3351 | 1.6764 | 2300 | 0.3304 | 0.7451 | 0.8090 | 0.6093 |
79
+ | 0.3269 | 1.7493 | 2400 | 0.3282 | 0.7440 | 0.8069 | 0.6157 |
80
+ | 0.3247 | 1.8222 | 2500 | 0.3257 | 0.7505 | 0.8127 | 0.6129 |
81
+ | 0.3241 | 1.8950 | 2600 | 0.3218 | 0.7578 | 0.8181 | 0.6239 |
82
+ | 0.3135 | 1.9679 | 2700 | 0.3260 | 0.7430 | 0.8070 | 0.5956 |
83
+ | 0.3125 | 2.0408 | 2800 | 0.3212 | 0.7590 | 0.8193 | 0.6275 |
84
+ | 0.2995 | 2.1137 | 2900 | 0.3237 | 0.7510 | 0.8144 | 0.6257 |
85
+ | 0.3031 | 2.1866 | 3000 | 0.3239 | 0.7494 | 0.8115 | 0.6193 |
86
+ | 0.2772 | 2.2595 | 3100 | 0.3229 | 0.7463 | 0.8101 | 0.6157 |
87
+ | 0.3043 | 2.3324 | 3200 | 0.3279 | 0.7429 | 0.8068 | 0.6056 |
88
+ | 0.298 | 2.4052 | 3300 | 0.3258 | 0.7489 | 0.8122 | 0.6075 |
89
+ | 0.2933 | 2.4781 | 3400 | 0.3318 | 0.7486 | 0.8123 | 0.6248 |
90
+ | 0.2723 | 2.5510 | 3500 | 0.3326 | 0.7566 | 0.8188 | 0.6357 |
91
+ | 0.2957 | 2.6239 | 3600 | 0.3271 | 0.7566 | 0.8195 | 0.6339 |
92
+ | 0.2715 | 2.6968 | 3700 | 0.3366 | 0.7568 | 0.8197 | 0.6166 |
93
+ | 0.2825 | 2.7697 | 3800 | 0.3254 | 0.7512 | 0.8138 | 0.6166 |
94
+ | 0.2844 | 2.8426 | 3900 | 0.3268 | 0.7527 | 0.8151 | 0.6138 |
95
+ | 0.2721 | 2.9155 | 4000 | 0.3277 | 0.7595 | 0.8214 | 0.6230 |
96
+ | 0.288 | 2.9883 | 4100 | 0.3262 | 0.7551 | 0.8179 | 0.6211 |
97
+ | 0.2615 | 3.0612 | 4200 | 0.3545 | 0.7578 | 0.8227 | 0.6066 |
98
+ | 0.2614 | 3.1341 | 4300 | 0.3308 | 0.7530 | 0.8163 | 0.6311 |
99
+ | 0.2624 | 3.2070 | 4400 | 0.3316 | 0.7626 | 0.8235 | 0.6302 |
100
+ | 0.2458 | 3.2799 | 4500 | 0.3472 | 0.7556 | 0.8193 | 0.6202 |
101
+ | 0.2707 | 3.3528 | 4600 | 0.3347 | 0.7568 | 0.8195 | 0.6257 |
102
+ | 0.2557 | 3.4257 | 4700 | 0.3339 | 0.7535 | 0.8169 | 0.6211 |
103
+ | 0.2583 | 3.4985 | 4800 | 0.3348 | 0.7564 | 0.8198 | 0.6211 |
104
+ | 0.2716 | 3.5714 | 4900 | 0.3405 | 0.7509 | 0.8165 | 0.6084 |
105
+ | 0.258 | 3.6443 | 5000 | 0.3366 | 0.7606 | 0.8236 | 0.6193 |
106
+ | 0.2689 | 3.7172 | 5100 | 0.3505 | 0.7524 | 0.8155 | 0.6120 |
107
+ | 0.271 | 3.7901 | 5200 | 0.3331 | 0.7577 | 0.8199 | 0.6129 |
108
+ | 0.2534 | 3.8630 | 5300 | 0.3325 | 0.7526 | 0.8162 | 0.6275 |
109
+ | 0.2637 | 3.9359 | 5400 | 0.3324 | 0.7514 | 0.8147 | 0.6248 |
110
+ | 0.2556 | 4.0087 | 5500 | 0.3419 | 0.7557 | 0.8188 | 0.6111 |
111
+ | 0.243 | 4.0816 | 5600 | 0.3354 | 0.7540 | 0.8173 | 0.6230 |
112
+ | 0.2282 | 4.1545 | 5700 | 0.3547 | 0.7553 | 0.8215 | 0.6138 |
113
+ | 0.2229 | 4.2274 | 5800 | 0.3414 | 0.7604 | 0.8229 | 0.6412 |
114
+ | 0.2441 | 4.3003 | 5900 | 0.3454 | 0.7594 | 0.8222 | 0.6230 |
115
+ | 0.2313 | 4.3732 | 6000 | 0.3428 | 0.7558 | 0.8203 | 0.6093 |
116
+ | 0.2277 | 4.4461 | 6100 | 0.3441 | 0.7523 | 0.8165 | 0.6129 |
117
+ | 0.2667 | 4.5190 | 6200 | 0.3456 | 0.7515 | 0.8157 | 0.6257 |
118
+ | 0.2339 | 4.5918 | 6300 | 0.3468 | 0.7573 | 0.8185 | 0.6430 |
119
+ | 0.2356 | 4.6647 | 6400 | 0.3554 | 0.7551 | 0.8185 | 0.6275 |
120
+ | 0.2288 | 4.7376 | 6500 | 0.3499 | 0.7561 | 0.8189 | 0.6330 |
121
+ | 0.2165 | 4.8105 | 6600 | 0.3767 | 0.7491 | 0.8137 | 0.6102 |
122
+ | 0.2318 | 4.8834 | 6700 | 0.3530 | 0.7571 | 0.8204 | 0.6366 |
123
+ | 0.2265 | 4.9563 | 6800 | 0.3534 | 0.7593 | 0.8225 | 0.6257 |
124
+ | 0.2062 | 5.0292 | 6900 | 0.3588 | 0.7522 | 0.8168 | 0.6129 |
125
+ | 0.211 | 5.1020 | 7000 | 0.3548 | 0.7497 | 0.8148 | 0.6184 |
126
+ | 0.2109 | 5.1749 | 7100 | 0.3522 | 0.7509 | 0.8152 | 0.6266 |
127
+ | 0.1995 | 5.2478 | 7200 | 0.3673 | 0.7536 | 0.8182 | 0.6211 |
128
+ | 0.2274 | 5.3207 | 7300 | 0.3615 | 0.7539 | 0.8178 | 0.6230 |
129
+ | 0.2126 | 5.3936 | 7400 | 0.3610 | 0.7533 | 0.8172 | 0.6220 |
130
+ | 0.2086 | 5.4665 | 7500 | 0.3666 | 0.7513 | 0.8163 | 0.6066 |
131
+ | 0.2129 | 5.5394 | 7600 | 0.3582 | 0.7506 | 0.8155 | 0.6202 |
132
+ | 0.2243 | 5.6122 | 7700 | 0.3576 | 0.7495 | 0.8143 | 0.6202 |
133
+ | 0.2028 | 5.6851 | 7800 | 0.3649 | 0.7552 | 0.8192 | 0.6284 |
134
+ | 0.2233 | 5.7580 | 7900 | 0.3618 | 0.7573 | 0.8216 | 0.6275 |
135
+ | 0.2273 | 5.8309 | 8000 | 0.3632 | 0.7570 | 0.8207 | 0.6193 |
136
+ | 0.2045 | 5.9038 | 8100 | 0.3621 | 0.7578 | 0.8201 | 0.6357 |
137
+ | 0.209 | 5.9767 | 8200 | 0.3690 | 0.7578 | 0.8204 | 0.6311 |
138
+ | 0.2051 | 6.0496 | 8300 | 0.3695 | 0.7583 | 0.8210 | 0.6211 |
139
+ | 0.1826 | 6.1224 | 8400 | 0.3726 | 0.7556 | 0.8191 | 0.6321 |
140
+ | 0.1935 | 6.1953 | 8500 | 0.3762 | 0.7522 | 0.8176 | 0.6129 |
141
+ | 0.18 | 6.2682 | 8600 | 0.3743 | 0.7557 | 0.8198 | 0.6257 |
142
+ | 0.1815 | 6.3411 | 8700 | 0.3769 | 0.7535 | 0.8192 | 0.6102 |
143
+ | 0.2045 | 6.4140 | 8800 | 0.3762 | 0.7513 | 0.8166 | 0.6111 |
144
+ | 0.1792 | 6.4869 | 8900 | 0.3753 | 0.7559 | 0.8197 | 0.6202 |
145
+ | 0.1813 | 6.5598 | 9000 | 0.3780 | 0.7550 | 0.8190 | 0.6211 |
146
+ | 0.1962 | 6.6327 | 9100 | 0.3814 | 0.7551 | 0.8190 | 0.6211 |
147
+ | 0.2049 | 6.7055 | 9200 | 0.3855 | 0.7607 | 0.8233 | 0.6257 |
148
+ | 0.1992 | 6.7784 | 9300 | 0.3861 | 0.7518 | 0.8167 | 0.6193 |
149
+ | 0.1767 | 6.8513 | 9400 | 0.3844 | 0.7594 | 0.8230 | 0.6220 |
150
+ | 0.2234 | 6.9242 | 9500 | 0.3818 | 0.7537 | 0.8181 | 0.6157 |
151
+ | 0.188 | 6.9971 | 9600 | 0.3863 | 0.7562 | 0.8202 | 0.6157 |
152
+ | 0.1694 | 7.0700 | 9700 | 0.3864 | 0.7493 | 0.8154 | 0.6148 |
153
+ | 0.174 | 7.1429 | 9800 | 0.3919 | 0.7559 | 0.8205 | 0.6193 |
154
+ | 0.1773 | 7.2157 | 9900 | 0.3936 | 0.7575 | 0.8213 | 0.6266 |
155
+ | 0.1808 | 7.2886 | 10000 | 0.4010 | 0.7621 | 0.8252 | 0.6275 |
156
+ | 0.1816 | 7.3615 | 10100 | 0.3946 | 0.7523 | 0.8178 | 0.6202 |
157
+ | 0.1866 | 7.4344 | 10200 | 0.3950 | 0.7508 | 0.8160 | 0.6220 |
158
+ | 0.1766 | 7.5073 | 10300 | 0.3991 | 0.7561 | 0.8194 | 0.6166 |
159
+ | 0.1855 | 7.5802 | 10400 | 0.3990 | 0.7527 | 0.8169 | 0.6275 |
160
+ | 0.1706 | 7.6531 | 10500 | 0.3980 | 0.7561 | 0.8202 | 0.6230 |
161
+ | 0.1798 | 7.7259 | 10600 | 0.3994 | 0.7552 | 0.8194 | 0.6166 |
162
+ | 0.1897 | 7.7988 | 10700 | 0.3956 | 0.7492 | 0.8147 | 0.6175 |
163
+ | 0.1735 | 7.8717 | 10800 | 0.3977 | 0.7523 | 0.8168 | 0.6202 |
164
+ | 0.1594 | 7.9446 | 10900 | 0.4004 | 0.7557 | 0.8198 | 0.6184 |
165
+ | 0.1683 | 8.0175 | 11000 | 0.4029 | 0.7480 | 0.8138 | 0.6157 |
166
+ | 0.154 | 8.0904 | 11100 | 0.4058 | 0.7565 | 0.8211 | 0.6157 |
167
+ | 0.1717 | 8.1633 | 11200 | 0.4086 | 0.7603 | 0.8232 | 0.6211 |
168
+ | 0.1572 | 8.2362 | 11300 | 0.4034 | 0.7547 | 0.8199 | 0.6157 |
169
+ | 0.1678 | 8.3090 | 11400 | 0.4055 | 0.7535 | 0.8190 | 0.6138 |
170
+ | 0.1804 | 8.3819 | 11500 | 0.4064 | 0.7459 | 0.8119 | 0.6157 |
171
+ | 0.1577 | 8.4548 | 11600 | 0.4067 | 0.7517 | 0.8170 | 0.6184 |
172
+ | 0.1718 | 8.5277 | 11700 | 0.4058 | 0.75 | 0.8155 | 0.6184 |
173
+ | 0.1577 | 8.6006 | 11800 | 0.4085 | 0.7519 | 0.8175 | 0.6166 |
174
+ | 0.1627 | 8.6735 | 11900 | 0.4084 | 0.7538 | 0.8189 | 0.6166 |
175
+ | 0.1602 | 8.7464 | 12000 | 0.4142 | 0.7502 | 0.8162 | 0.6157 |
176
+ | 0.1721 | 8.8192 | 12100 | 0.4095 | 0.7520 | 0.8172 | 0.6166 |
177
+ | 0.1791 | 8.8921 | 12200 | 0.4094 | 0.7528 | 0.8177 | 0.6166 |
178
+ | 0.1701 | 8.9650 | 12300 | 0.4150 | 0.7535 | 0.8182 | 0.6184 |
179
+ | 0.1711 | 9.0379 | 12400 | 0.4100 | 0.7539 | 0.8188 | 0.6157 |
180
+ | 0.1621 | 9.1108 | 12500 | 0.4102 | 0.7546 | 0.8189 | 0.6193 |
181
+ | 0.1467 | 9.1837 | 12600 | 0.4124 | 0.7532 | 0.8183 | 0.6138 |
182
+ | 0.1573 | 9.2566 | 12700 | 0.4137 | 0.7531 | 0.8183 | 0.6148 |
183
+ | 0.1684 | 9.3294 | 12800 | 0.4099 | 0.7537 | 0.8189 | 0.6184 |
184
+ | 0.1482 | 9.4023 | 12900 | 0.4118 | 0.7539 | 0.8188 | 0.6193 |
185
+ | 0.1693 | 9.4752 | 13000 | 0.4124 | 0.7527 | 0.8177 | 0.6193 |
186
+ | 0.1522 | 9.5481 | 13100 | 0.4128 | 0.7566 | 0.8205 | 0.6211 |
187
+ | 0.1387 | 9.6210 | 13200 | 0.4146 | 0.7529 | 0.8179 | 0.6157 |
188
+ | 0.1594 | 9.6939 | 13300 | 0.4140 | 0.7523 | 0.8176 | 0.6175 |
189
+ | 0.1699 | 9.7668 | 13400 | 0.4138 | 0.7530 | 0.8181 | 0.6157 |
190
+ | 0.1408 | 9.8397 | 13500 | 0.4136 | 0.7536 | 0.8187 | 0.6175 |
191
+ | 0.1634 | 9.9125 | 13600 | 0.4142 | 0.7541 | 0.8190 | 0.6184 |
192
+ | 0.1613 | 9.9854 | 13700 | 0.4143 | 0.7534 | 0.8185 | 0.6175 |
193
+
194
+
195
+ ### Framework versions
196
+
197
+ - Transformers 4.46.3
198
+ - Pytorch 2.5.1+cu121
199
+ - Datasets 3.1.0
200
+ - Tokenizers 0.20.3
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f960d2a8139853a8283dbe7baf582905cd5a489e01d02a863f9c0e2ad205e87
3
  size 447979528
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1e877e45b91e3b8addda0d7a61b6d095e847d2dd5f1fdbbb378b6b8169431bda
3
  size 447979528
tokenizer.json CHANGED
@@ -1,11 +1,6 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 512,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": null,
10
  "added_tokens": [
11
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {