Model save
Browse files- README.md +237 -7
- model.safetensors +1 -1
README.md
CHANGED
@@ -3,6 +3,8 @@ license: cc-by-nc-4.0
|
|
3 |
base_model: facebook/mms-1b-all
|
4 |
tags:
|
5 |
- generated_from_trainer
|
|
|
|
|
6 |
model-index:
|
7 |
- name: COPAS-mms1ball-Nov28
|
8 |
results: []
|
@@ -15,13 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on an unknown dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
-
|
19 |
-
-
|
20 |
-
- eval_runtime: 44.3244
|
21 |
-
- eval_samples_per_second: 3.768
|
22 |
-
- eval_steps_per_second: 0.948
|
23 |
-
- epoch: 45.4
|
24 |
-
- step: 22700
|
25 |
|
26 |
## Model description
|
27 |
|
@@ -50,6 +47,239 @@ The following hyperparameters were used during training:
|
|
50 |
- num_epochs: 100
|
51 |
- mixed_precision_training: Native AMP
|
52 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
53 |
### Framework versions
|
54 |
|
55 |
- Transformers 4.43.4
|
|
|
3 |
base_model: facebook/mms-1b-all
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
+
metrics:
|
7 |
+
- wer
|
8 |
model-index:
|
9 |
- name: COPAS-mms1ball-Nov28
|
10 |
results: []
|
|
|
17 |
|
18 |
This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 2.7926
|
21 |
+
- Wer: 0.9939
|
|
|
|
|
|
|
|
|
|
|
22 |
|
23 |
## Model description
|
24 |
|
|
|
47 |
- num_epochs: 100
|
48 |
- mixed_precision_training: Native AMP
|
49 |
|
50 |
+
### Training results
|
51 |
+
|
52 |
+
| Training Loss | Epoch | Step | Validation Loss | Wer |
|
53 |
+
|:-------------:|:-----:|:-----:|:---------------:|:------:|
|
54 |
+
| 41.8398 | 0.2 | 100 | 37.3207 | 1.0 |
|
55 |
+
| 35.3547 | 0.4 | 200 | 27.8741 | 1.0 |
|
56 |
+
| 24.7565 | 0.6 | 300 | 19.3606 | 1.0 |
|
57 |
+
| 17.5639 | 0.8 | 400 | 12.4668 | 1.0 |
|
58 |
+
| 11.4454 | 1.0 | 500 | 8.1410 | 1.0 |
|
59 |
+
| 7.2036 | 1.2 | 600 | 6.0696 | 1.0 |
|
60 |
+
| 5.8017 | 1.4 | 700 | 5.3140 | 1.0 |
|
61 |
+
| 5.1698 | 1.6 | 800 | 5.0560 | 1.0 |
|
62 |
+
| 5.0254 | 1.8 | 900 | 4.9179 | 1.0 |
|
63 |
+
| 4.905 | 2.0 | 1000 | 4.8243 | 1.0 |
|
64 |
+
| 4.7647 | 2.2 | 1100 | 4.7402 | 1.0 |
|
65 |
+
| 4.736 | 2.4 | 1200 | 4.6689 | 1.0 |
|
66 |
+
| 4.6916 | 2.6 | 1300 | 4.6021 | 1.0 |
|
67 |
+
| 4.5545 | 2.8 | 1400 | 4.5514 | 1.0 |
|
68 |
+
| 4.5882 | 3.0 | 1500 | 4.5018 | 1.0 |
|
69 |
+
| 4.5573 | 3.2 | 1600 | 4.4525 | 1.0 |
|
70 |
+
| 4.4508 | 3.4 | 1700 | 4.4071 | 1.0 |
|
71 |
+
| 4.369 | 3.6 | 1800 | 4.3695 | 1.0 |
|
72 |
+
| 4.3941 | 3.8 | 1900 | 4.3237 | 1.0 |
|
73 |
+
| 4.3409 | 4.0 | 2000 | 4.2806 | 1.0 |
|
74 |
+
| 4.3205 | 4.2 | 2100 | 4.2383 | 1.0 |
|
75 |
+
| 4.2341 | 4.4 | 2200 | 4.1957 | 1.0 |
|
76 |
+
| 4.225 | 4.6 | 2300 | 4.1566 | 1.0 |
|
77 |
+
| 4.2211 | 4.8 | 2400 | 4.1210 | 1.0 |
|
78 |
+
| 4.1923 | 5.0 | 2500 | 4.0818 | 1.0 |
|
79 |
+
| 4.1785 | 5.2 | 2600 | 4.0491 | 1.0 |
|
80 |
+
| 4.1258 | 5.4 | 2700 | 4.0162 | 1.0 |
|
81 |
+
| 4.058 | 5.6 | 2800 | 3.9873 | 1.0 |
|
82 |
+
| 4.0715 | 5.8 | 2900 | 3.9629 | 1.0 |
|
83 |
+
| 4.0675 | 6.0 | 3000 | 3.9299 | 1.0 |
|
84 |
+
| 4.0338 | 6.2 | 3100 | 3.9041 | 1.0 |
|
85 |
+
| 3.9778 | 6.4 | 3200 | 3.8747 | 1.0 |
|
86 |
+
| 3.9507 | 6.6 | 3300 | 3.8510 | 0.9998 |
|
87 |
+
| 3.9829 | 6.8 | 3400 | 3.8248 | 0.9998 |
|
88 |
+
| 3.9373 | 7.0 | 3500 | 3.8036 | 0.9998 |
|
89 |
+
| 3.9433 | 7.2 | 3600 | 3.7812 | 0.9998 |
|
90 |
+
| 3.8741 | 7.4 | 3700 | 3.7618 | 0.9996 |
|
91 |
+
| 3.9197 | 7.6 | 3800 | 3.7420 | 0.9996 |
|
92 |
+
| 3.8497 | 7.8 | 3900 | 3.7223 | 0.9996 |
|
93 |
+
| 3.8934 | 8.0 | 4000 | 3.7031 | 0.9996 |
|
94 |
+
| 3.8427 | 8.2 | 4100 | 3.6863 | 0.9998 |
|
95 |
+
| 3.7761 | 8.4 | 4200 | 3.6698 | 0.9992 |
|
96 |
+
| 3.8544 | 8.6 | 4300 | 3.6574 | 0.9992 |
|
97 |
+
| 3.7864 | 8.8 | 4400 | 3.6425 | 0.9987 |
|
98 |
+
| 3.7913 | 9.0 | 4500 | 3.6284 | 0.9985 |
|
99 |
+
| 3.8103 | 9.2 | 4600 | 3.6162 | 0.9983 |
|
100 |
+
| 3.813 | 9.4 | 4700 | 3.6125 | 0.9985 |
|
101 |
+
| 3.7552 | 9.6 | 4800 | 3.5955 | 0.9983 |
|
102 |
+
| 3.744 | 9.8 | 4900 | 3.5832 | 0.9983 |
|
103 |
+
| 3.7656 | 10.0 | 5000 | 3.5749 | 0.9983 |
|
104 |
+
| 3.7119 | 10.2 | 5100 | 3.5686 | 0.9979 |
|
105 |
+
| 3.7246 | 10.4 | 5200 | 3.5613 | 0.9979 |
|
106 |
+
| 3.6999 | 10.6 | 5300 | 3.5483 | 0.9979 |
|
107 |
+
| 3.6942 | 10.8 | 5400 | 3.5395 | 0.9979 |
|
108 |
+
| 3.7076 | 11.0 | 5500 | 3.5338 | 0.9979 |
|
109 |
+
| 3.6577 | 11.2 | 5600 | 3.5219 | 0.9973 |
|
110 |
+
| 3.6771 | 11.4 | 5700 | 3.5147 | 0.9968 |
|
111 |
+
| 3.6948 | 11.6 | 5800 | 3.5055 | 0.9966 |
|
112 |
+
| 3.6699 | 11.8 | 5900 | 3.4969 | 0.9970 |
|
113 |
+
| 3.6306 | 12.0 | 6000 | 3.4868 | 0.9968 |
|
114 |
+
| 3.6393 | 12.2 | 6100 | 3.4800 | 0.9966 |
|
115 |
+
| 3.6745 | 12.4 | 6200 | 3.4712 | 0.9964 |
|
116 |
+
| 3.6641 | 12.6 | 6300 | 3.4659 | 0.9962 |
|
117 |
+
| 3.6167 | 12.8 | 6400 | 3.4595 | 0.9962 |
|
118 |
+
| 3.5831 | 13.0 | 6500 | 3.4537 | 0.9968 |
|
119 |
+
| 3.614 | 13.2 | 6600 | 3.4478 | 0.9966 |
|
120 |
+
| 3.6363 | 13.4 | 6700 | 3.4400 | 0.9966 |
|
121 |
+
| 3.6048 | 13.6 | 6800 | 3.4337 | 0.9966 |
|
122 |
+
| 3.5488 | 13.8 | 6900 | 3.4302 | 0.9966 |
|
123 |
+
| 3.5874 | 14.0 | 7000 | 3.4221 | 0.9964 |
|
124 |
+
| 3.5673 | 14.2 | 7100 | 3.4161 | 0.9962 |
|
125 |
+
| 3.5918 | 14.4 | 7200 | 3.4074 | 0.9964 |
|
126 |
+
| 3.6221 | 14.6 | 7300 | 3.4017 | 0.9964 |
|
127 |
+
| 3.516 | 14.8 | 7400 | 3.3931 | 0.9962 |
|
128 |
+
| 3.5529 | 15.0 | 7500 | 3.3872 | 0.9960 |
|
129 |
+
| 3.5173 | 15.2 | 7600 | 3.3806 | 0.9962 |
|
130 |
+
| 3.5608 | 15.4 | 7700 | 3.3721 | 0.9962 |
|
131 |
+
| 3.6101 | 15.6 | 7800 | 3.3639 | 0.9960 |
|
132 |
+
| 3.546 | 15.8 | 7900 | 3.3600 | 0.9958 |
|
133 |
+
| 3.481 | 16.0 | 8000 | 3.3549 | 0.9956 |
|
134 |
+
| 3.5324 | 16.2 | 8100 | 3.3471 | 0.9958 |
|
135 |
+
| 3.48 | 16.4 | 8200 | 3.3426 | 0.9962 |
|
136 |
+
| 3.5563 | 16.6 | 8300 | 3.3362 | 0.9956 |
|
137 |
+
| 3.5228 | 16.8 | 8400 | 3.3274 | 0.9949 |
|
138 |
+
| 3.4865 | 17.0 | 8500 | 3.3205 | 0.9954 |
|
139 |
+
| 3.5468 | 17.2 | 8600 | 3.3145 | 0.9962 |
|
140 |
+
| 3.4576 | 17.4 | 8700 | 3.3095 | 0.9962 |
|
141 |
+
| 3.4646 | 17.6 | 8800 | 3.3037 | 0.9964 |
|
142 |
+
| 3.4869 | 17.8 | 8900 | 3.2983 | 0.9966 |
|
143 |
+
| 3.4992 | 18.0 | 9000 | 3.2931 | 0.9962 |
|
144 |
+
| 3.4827 | 18.2 | 9100 | 3.2877 | 0.9960 |
|
145 |
+
| 3.4456 | 18.4 | 9200 | 3.2828 | 0.9956 |
|
146 |
+
| 3.5042 | 18.6 | 9300 | 3.2753 | 0.9960 |
|
147 |
+
| 3.4347 | 18.8 | 9400 | 3.2682 | 0.9968 |
|
148 |
+
| 3.4161 | 19.0 | 9500 | 3.2632 | 0.9962 |
|
149 |
+
| 3.4209 | 19.2 | 9600 | 3.2591 | 0.9958 |
|
150 |
+
| 3.4458 | 19.4 | 9700 | 3.2498 | 0.9962 |
|
151 |
+
| 3.4085 | 19.6 | 9800 | 3.2460 | 0.9956 |
|
152 |
+
| 3.4897 | 19.8 | 9900 | 3.2420 | 0.9958 |
|
153 |
+
| 3.4025 | 20.0 | 10000 | 3.2348 | 0.9960 |
|
154 |
+
| 3.4297 | 20.2 | 10100 | 3.2282 | 0.9962 |
|
155 |
+
| 3.4365 | 20.4 | 10200 | 3.2221 | 0.9968 |
|
156 |
+
| 3.4129 | 20.6 | 10300 | 3.2183 | 0.9962 |
|
157 |
+
| 3.4254 | 20.8 | 10400 | 3.2141 | 0.9960 |
|
158 |
+
| 3.3604 | 21.0 | 10500 | 3.2090 | 0.9958 |
|
159 |
+
| 3.3915 | 21.2 | 10600 | 3.2024 | 0.9956 |
|
160 |
+
| 3.4077 | 21.4 | 10700 | 3.1985 | 0.9958 |
|
161 |
+
| 3.3831 | 21.6 | 10800 | 3.1936 | 0.9962 |
|
162 |
+
| 3.414 | 21.8 | 10900 | 3.1885 | 0.9960 |
|
163 |
+
| 3.3778 | 22.0 | 11000 | 3.1834 | 0.9958 |
|
164 |
+
| 3.3987 | 22.2 | 11100 | 3.1798 | 0.9954 |
|
165 |
+
| 3.4096 | 22.4 | 11200 | 3.1766 | 0.9956 |
|
166 |
+
| 3.3784 | 22.6 | 11300 | 3.1724 | 0.9960 |
|
167 |
+
| 3.4194 | 22.8 | 11400 | 3.1680 | 0.9958 |
|
168 |
+
| 3.3011 | 23.0 | 11500 | 3.1658 | 0.9958 |
|
169 |
+
| 3.3206 | 23.2 | 11600 | 3.1631 | 0.9956 |
|
170 |
+
| 3.3476 | 23.4 | 11700 | 3.1577 | 0.9956 |
|
171 |
+
| 3.3604 | 23.6 | 11800 | 3.1540 | 0.9949 |
|
172 |
+
| 3.4032 | 23.8 | 11900 | 3.1475 | 0.9949 |
|
173 |
+
| 3.3523 | 24.0 | 12000 | 3.1421 | 0.9947 |
|
174 |
+
| 3.3223 | 24.2 | 12100 | 3.1386 | 0.9949 |
|
175 |
+
| 3.3869 | 24.4 | 12200 | 3.1342 | 0.9941 |
|
176 |
+
| 3.3354 | 24.6 | 12300 | 3.1295 | 0.9943 |
|
177 |
+
| 3.288 | 24.8 | 12400 | 3.1268 | 0.9941 |
|
178 |
+
| 3.3012 | 25.0 | 12500 | 3.1214 | 0.9939 |
|
179 |
+
| 3.3247 | 25.2 | 12600 | 3.1181 | 0.9939 |
|
180 |
+
| 3.3291 | 25.4 | 12700 | 3.1167 | 0.9935 |
|
181 |
+
| 3.3392 | 25.6 | 12800 | 3.1127 | 0.9939 |
|
182 |
+
| 3.281 | 25.8 | 12900 | 3.1089 | 0.9943 |
|
183 |
+
| 3.3083 | 26.0 | 13000 | 3.1030 | 0.9951 |
|
184 |
+
| 3.3973 | 26.2 | 13100 | 3.0982 | 0.9945 |
|
185 |
+
| 3.2582 | 26.4 | 13200 | 3.0948 | 0.9956 |
|
186 |
+
| 3.2509 | 26.6 | 13300 | 3.0916 | 0.9947 |
|
187 |
+
| 3.3027 | 26.8 | 13400 | 3.0875 | 0.9954 |
|
188 |
+
| 3.295 | 27.0 | 13500 | 3.0833 | 0.9937 |
|
189 |
+
| 3.2916 | 27.2 | 13600 | 3.0805 | 0.9943 |
|
190 |
+
| 3.2945 | 27.4 | 13700 | 3.0774 | 0.9937 |
|
191 |
+
| 3.2584 | 27.6 | 13800 | 3.0747 | 0.9939 |
|
192 |
+
| 3.3343 | 27.8 | 13900 | 3.0699 | 0.9945 |
|
193 |
+
| 3.24 | 28.0 | 14000 | 3.0661 | 0.9949 |
|
194 |
+
| 3.2768 | 28.2 | 14100 | 3.0614 | 0.9941 |
|
195 |
+
| 3.2713 | 28.4 | 14200 | 3.0587 | 0.9935 |
|
196 |
+
| 3.1811 | 28.6 | 14300 | 3.0544 | 0.9935 |
|
197 |
+
| 3.3279 | 28.8 | 14400 | 3.0506 | 0.9945 |
|
198 |
+
| 3.3166 | 29.0 | 14500 | 3.0470 | 0.9943 |
|
199 |
+
| 3.2904 | 29.2 | 14600 | 3.0454 | 0.9945 |
|
200 |
+
| 3.1675 | 29.4 | 14700 | 3.0395 | 0.9941 |
|
201 |
+
| 3.2665 | 29.6 | 14800 | 3.0368 | 0.9939 |
|
202 |
+
| 3.2087 | 29.8 | 14900 | 3.0320 | 0.9943 |
|
203 |
+
| 3.3436 | 30.0 | 15000 | 3.0290 | 0.9945 |
|
204 |
+
| 3.2558 | 30.2 | 15100 | 3.0267 | 0.9941 |
|
205 |
+
| 3.2631 | 30.4 | 15200 | 3.0222 | 0.9941 |
|
206 |
+
| 3.3143 | 30.6 | 15300 | 3.0184 | 0.9941 |
|
207 |
+
| 3.1722 | 30.8 | 15400 | 3.0135 | 0.9943 |
|
208 |
+
| 3.1736 | 31.0 | 15500 | 3.0101 | 0.9937 |
|
209 |
+
| 3.2694 | 31.2 | 15600 | 3.0052 | 0.9941 |
|
210 |
+
| 3.2143 | 31.4 | 15700 | 3.0015 | 0.9937 |
|
211 |
+
| 3.2431 | 31.6 | 15800 | 2.9993 | 0.9939 |
|
212 |
+
| 3.194 | 31.8 | 15900 | 2.9961 | 0.9937 |
|
213 |
+
| 3.1784 | 32.0 | 16000 | 2.9906 | 0.9937 |
|
214 |
+
| 3.239 | 32.2 | 16100 | 2.9866 | 0.9930 |
|
215 |
+
| 3.1766 | 32.4 | 16200 | 2.9837 | 0.9945 |
|
216 |
+
| 3.2049 | 32.6 | 16300 | 2.9788 | 0.9945 |
|
217 |
+
| 3.2638 | 32.8 | 16400 | 2.9769 | 0.9943 |
|
218 |
+
| 3.1008 | 33.0 | 16500 | 2.9749 | 0.9941 |
|
219 |
+
| 3.1918 | 33.2 | 16600 | 2.9728 | 0.9947 |
|
220 |
+
| 3.2645 | 33.4 | 16700 | 2.9702 | 0.9949 |
|
221 |
+
| 3.1329 | 33.6 | 16800 | 2.9615 | 0.9949 |
|
222 |
+
| 3.2031 | 33.8 | 16900 | 2.9575 | 0.9947 |
|
223 |
+
| 3.1297 | 34.0 | 17000 | 2.9542 | 0.9947 |
|
224 |
+
| 3.115 | 34.2 | 17100 | 2.9521 | 0.9947 |
|
225 |
+
| 3.1786 | 34.4 | 17200 | 2.9503 | 0.9947 |
|
226 |
+
| 3.1434 | 34.6 | 17300 | 2.9452 | 0.9949 |
|
227 |
+
| 3.2159 | 34.8 | 17400 | 2.9415 | 0.9943 |
|
228 |
+
| 3.1425 | 35.0 | 17500 | 2.9366 | 0.9943 |
|
229 |
+
| 3.1596 | 35.2 | 17600 | 2.9328 | 0.9943 |
|
230 |
+
| 3.1411 | 35.4 | 17700 | 2.9308 | 0.9935 |
|
231 |
+
| 3.2655 | 35.6 | 17800 | 2.9263 | 0.9941 |
|
232 |
+
| 3.1058 | 35.8 | 17900 | 2.9235 | 0.9928 |
|
233 |
+
| 3.1415 | 36.0 | 18000 | 2.9210 | 0.9930 |
|
234 |
+
| 3.1031 | 36.2 | 18100 | 2.9178 | 0.9935 |
|
235 |
+
| 3.1074 | 36.4 | 18200 | 2.9148 | 0.9939 |
|
236 |
+
| 3.0887 | 36.6 | 18300 | 2.9107 | 0.9937 |
|
237 |
+
| 3.2359 | 36.8 | 18400 | 2.9078 | 0.9932 |
|
238 |
+
| 3.137 | 37.0 | 18500 | 2.9060 | 0.9935 |
|
239 |
+
| 3.1064 | 37.2 | 18600 | 2.9044 | 0.9935 |
|
240 |
+
| 3.0584 | 37.4 | 18700 | 2.9010 | 0.9947 |
|
241 |
+
| 3.1004 | 37.6 | 18800 | 2.8977 | 0.9943 |
|
242 |
+
| 3.1034 | 37.8 | 18900 | 2.8948 | 0.9945 |
|
243 |
+
| 3.2163 | 38.0 | 19000 | 2.8906 | 0.9945 |
|
244 |
+
| 3.0611 | 38.2 | 19100 | 2.8864 | 0.9949 |
|
245 |
+
| 3.0713 | 38.4 | 19200 | 2.8852 | 0.9947 |
|
246 |
+
| 3.1233 | 38.6 | 19300 | 2.8816 | 0.9947 |
|
247 |
+
| 3.1374 | 38.8 | 19400 | 2.8776 | 0.9943 |
|
248 |
+
| 3.157 | 39.0 | 19500 | 2.8758 | 0.9937 |
|
249 |
+
| 3.1202 | 39.2 | 19600 | 2.8747 | 0.9935 |
|
250 |
+
| 3.0945 | 39.4 | 19700 | 2.8713 | 0.9941 |
|
251 |
+
| 3.0415 | 39.6 | 19800 | 2.8680 | 0.9947 |
|
252 |
+
| 3.0462 | 39.8 | 19900 | 2.8626 | 0.9941 |
|
253 |
+
| 3.1603 | 40.0 | 20000 | 2.8618 | 0.9943 |
|
254 |
+
| 3.0741 | 40.2 | 20100 | 2.8571 | 0.9945 |
|
255 |
+
| 3.0228 | 40.4 | 20200 | 2.8556 | 0.9949 |
|
256 |
+
| 3.1765 | 40.6 | 20300 | 2.8511 | 0.9943 |
|
257 |
+
| 3.027 | 40.8 | 20400 | 2.8478 | 0.9949 |
|
258 |
+
| 3.0472 | 41.0 | 20500 | 2.8451 | 0.9949 |
|
259 |
+
| 3.0993 | 41.2 | 20600 | 2.8446 | 0.9937 |
|
260 |
+
| 3.0562 | 41.4 | 20700 | 2.8414 | 0.9943 |
|
261 |
+
| 3.1409 | 41.6 | 20800 | 2.8383 | 0.9945 |
|
262 |
+
| 3.004 | 41.8 | 20900 | 2.8355 | 0.9943 |
|
263 |
+
| 3.0377 | 42.0 | 21000 | 2.8352 | 0.9945 |
|
264 |
+
| 3.1136 | 42.2 | 21100 | 2.8304 | 0.9949 |
|
265 |
+
| 3.0709 | 42.4 | 21200 | 2.8272 | 0.9947 |
|
266 |
+
| 3.0435 | 42.6 | 21300 | 2.8227 | 0.9947 |
|
267 |
+
| 3.0247 | 42.8 | 21400 | 2.8226 | 0.9943 |
|
268 |
+
| 3.0393 | 43.0 | 21500 | 2.8220 | 0.9949 |
|
269 |
+
| 3.037 | 43.2 | 21600 | 2.8182 | 0.9954 |
|
270 |
+
| 3.0403 | 43.4 | 21700 | 2.8149 | 0.9951 |
|
271 |
+
| 3.1406 | 43.6 | 21800 | 2.8141 | 0.9947 |
|
272 |
+
| 2.9519 | 43.8 | 21900 | 2.8129 | 0.9943 |
|
273 |
+
| 2.9742 | 44.0 | 22000 | 2.8075 | 0.9949 |
|
274 |
+
| 3.0384 | 44.2 | 22100 | 2.8039 | 0.9947 |
|
275 |
+
| 3.0387 | 44.4 | 22200 | 2.8021 | 0.9951 |
|
276 |
+
| 3.0851 | 44.6 | 22300 | 2.8025 | 0.9945 |
|
277 |
+
| 3.0079 | 44.8 | 22400 | 2.7977 | 0.9945 |
|
278 |
+
| 2.9731 | 45.0 | 22500 | 2.7951 | 0.9937 |
|
279 |
+
| 2.9938 | 45.2 | 22600 | 2.7972 | 0.9939 |
|
280 |
+
| 2.9564 | 45.4 | 22700 | 2.7926 | 0.9939 |
|
281 |
+
|
282 |
+
|
283 |
### Framework versions
|
284 |
|
285 |
- Transformers 4.43.4
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 3859172744
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:54871f5788d56c60445730a0e9f1ba50a4b37c92bb4f1bc067baa4eca6c1a1c7
|
3 |
size 3859172744
|