stulcrad
/

fine_tuned_XLMROBERTA_cs_wikann

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1699
-- Precision: 0.9133
-- Recall: 0.9319
-- F1: 0.9225
-- Accuracy: 0.9699
 ## Model description
@@ -44,52 +44,35 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 7
 ### Training results
-| Training Loss | Epoch | Step  | Validation Loss | Precision | Recall | F1     | Accuracy |
-|:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 0.7699        | 0.2   | 500   | 0.3588          | 0.5878    | 0.6990 | 0.6386 | 0.8894   |
-| 0.3658        | 0.4   | 1000  | 0.2538          | 0.7427    | 0.8258 | 0.7821 | 0.9355   |
-| 0.301         | 0.6   | 1500  | 0.2403          | 0.7649    | 0.8237 | 0.7932 | 0.9400   |
-| 0.2796        | 0.8   | 2000  | 0.1828          | 0.7967    | 0.8509 | 0.8229 | 0.9456   |
-| 0.258         | 1.0   | 2500  | 0.2223          | 0.7770    | 0.8322 | 0.8037 | 0.9400   |
-| 0.2192        | 1.2   | 3000  | 0.1911          | 0.8156    | 0.8745 | 0.8440 | 0.9511   |
-| 0.2161        | 1.4   | 3500  | 0.1878          | 0.8401    | 0.8858 | 0.8623 | 0.9551   |
-| 0.2095        | 1.6   | 4000  | 0.1916          | 0.8306    | 0.8783 | 0.8538 | 0.9559   |
-| 0.2137        | 1.8   | 4500  | 0.1657          | 0.8573    | 0.8874 | 0.8721 | 0.9585   |
-| 0.1884        | 2.0   | 5000  | 0.2134          | 0.8486    | 0.8837 | 0.8658 | 0.9542   |
-| 0.164         | 2.2   | 5500  | 0.2038          | 0.8619    | 0.9048 | 0.8828 | 0.9588   |
-| 0.1564        | 2.4   | 6000  | 0.1707          | 0.8502    | 0.8874 | 0.8684 | 0.9582   |
-| 0.1719        | 2.6   | 6500  | 0.1781          | 0.8645    | 0.8994 | 0.8816 | 0.9610   |
-| 0.1565        | 2.8   | 7000  | 0.1908          | 0.8712    | 0.9021 | 0.8864 | 0.9614   |
-| 0.1713        | 3.0   | 7500  | 0.1628          | 0.8672    | 0.8954 | 0.8811 | 0.9623   |
-| 0.1359        | 3.2   | 8000  | 0.1890          | 0.8684    | 0.9072 | 0.8874 | 0.9624   |
-| 0.1362        | 3.4   | 8500  | 0.1672          | 0.8653    | 0.9065 | 0.8854 | 0.9620   |
-| 0.1301        | 3.6   | 9000  | 0.1866          | 0.8698    | 0.9069 | 0.8879 | 0.9631   |
-| 0.1345        | 3.8   | 9500  | 0.1766          | 0.8759    | 0.9071 | 0.8913 | 0.9647   |
-| 0.1363        | 4.0   | 10000 | 0.1817          | 0.8700    | 0.9137 | 0.8913 | 0.9626   |
-| 0.1097        | 4.2   | 10500 | 0.1611          | 0.8861    | 0.9118 | 0.8987 | 0.9653   |
-| 0.1045        | 4.4   | 11000 | 0.1743          | 0.8899    | 0.9123 | 0.9009 | 0.9659   |
-| 0.1068        | 4.6   | 11500 | 0.1771          | 0.8870    | 0.9167 | 0.9016 | 0.9660   |
-| 0.1168        | 4.8   | 12000 | 0.1704          | 0.8894    | 0.9174 | 0.9032 | 0.9660   |
-| 0.1116        | 5.0   | 12500 | 0.1748          | 0.8926    | 0.9203 | 0.9062 | 0.9673   |
-| 0.0979        | 5.2   | 13000 | 0.1726          | 0.8956    | 0.9255 | 0.9103 | 0.9672   |
-| 0.0992        | 5.4   | 13500 | 0.1798          | 0.9058    | 0.9280 | 0.9168 | 0.9686   |
-| 0.0929        | 5.6   | 14000 | 0.1740          | 0.9063    | 0.9304 | 0.9182 | 0.9693   |
-| 0.098         | 5.8   | 14500 | 0.1690          | 0.8931    | 0.9262 | 0.9094 | 0.9683   |
-| 0.0878        | 6.0   | 15000 | 0.1682          | 0.9065    | 0.9294 | 0.9178 | 0.9696   |
-| 0.0925        | 6.2   | 15500 | 0.1691          | 0.9102    | 0.9308 | 0.9204 | 0.9694   |
-| 0.0841        | 6.4   | 16000 | 0.1657          | 0.9138    | 0.9298 | 0.9217 | 0.9699   |
-| 0.0748        | 6.6   | 16500 | 0.1696          | 0.9114    | 0.9313 | 0.9213 | 0.9695   |
-| 0.0753        | 6.8   | 17000 | 0.1703          | 0.9118    | 0.9311 | 0.9214 | 0.9697   |
-| 0.073         | 7.0   | 17500 | 0.1699          | 0.9133    | 0.9319 | 0.9225 | 0.9699   |
 ### Framework versions

 This model is a fine-tuned version of [FacebookAI/xlm-roberta-large](https://huggingface.co/FacebookAI/xlm-roberta-large) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1543
+- Precision: 0.9203
+- Recall: 0.9342
+- F1: 0.9272
+- Accuracy: 0.9732
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
+|:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 0.51          | 0.27  | 500  | 0.1995          | 0.7873    | 0.8274 | 0.8069 | 0.9435   |
+| 0.2164        | 0.53  | 1000 | 0.2216          | 0.7743    | 0.8430 | 0.8072 | 0.9407   |
+| 0.1963        | 0.8   | 1500 | 0.1673          | 0.8465    | 0.8849 | 0.8653 | 0.9534   |
+| 0.1478        | 1.07  | 2000 | 0.1612          | 0.8850    | 0.9    | 0.8925 | 0.9629   |
+| 0.1316        | 1.33  | 2500 | 0.1508          | 0.8765    | 0.9081 | 0.8920 | 0.9615   |
+| 0.1156        | 1.6   | 3000 | 0.1561          | 0.9028    | 0.9081 | 0.9054 | 0.9656   |
+| 0.1069        | 1.87  | 3500 | 0.1544          | 0.9009    | 0.9091 | 0.9050 | 0.9651   |
+| 0.0925        | 2.13  | 4000 | 0.1724          | 0.9008    | 0.9216 | 0.9111 | 0.9662   |
+| 0.0791        | 2.4   | 4500 | 0.1385          | 0.9096    | 0.9201 | 0.9148 | 0.9705   |
+| 0.0739        | 2.67  | 5000 | 0.1309          | 0.9130    | 0.9254 | 0.9192 | 0.9701   |
+| 0.0732        | 2.93  | 5500 | 0.1593          | 0.9035    | 0.9190 | 0.9112 | 0.9679   |
+| 0.0538        | 3.2   | 6000 | 0.1550          | 0.9193    | 0.9309 | 0.9251 | 0.9722   |
+| 0.0529        | 3.47  | 6500 | 0.1451          | 0.9112    | 0.9330 | 0.9220 | 0.9710   |
+| 0.0521        | 3.73  | 7000 | 0.1510          | 0.9185    | 0.9323 | 0.9253 | 0.9721   |
+| 0.0526        | 4.0   | 7500 | 0.1378          | 0.9173    | 0.9325 | 0.9249 | 0.9727   |
+| 0.0377        | 4.27  | 8000 | 0.1501          | 0.9164    | 0.9344 | 0.9253 | 0.9728   |
+| 0.0382        | 4.53  | 8500 | 0.1541          | 0.9213    | 0.9352 | 0.9282 | 0.9729   |
+| 0.0358        | 4.8   | 9000 | 0.1543          | 0.9203    | 0.9342 | 0.9272 | 0.9732   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4e251f20e9443b5cd4f815299a986175d93d8897d54edf2cee3960ccb4db6629
 size 2235440556

 version https://git-lfs.github.com/spec/v1
+oid sha256:4488d827727ef1b0088e090b30482f601000e7bc43783e180eedc0ef21db1ec9
 size 2235440556

runs/Feb21_19-04-07_n21/events.out.tfevents.1708538654.n21.1436555.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9b312e9e4490070e6053005d7db3ff980d9bb086eadda3c304abb8045d86f6fb
-size 16000

 version https://git-lfs.github.com/spec/v1
+oid sha256:84299ba6ff09150d2bc829f0f15eb34b933a963e106cfcac59c3df10430b599e
+size 16354