muhammadravi251001
/

fine-tuned-DatasetQAS-TYDI-QA-ID-with-indobert-base-uncased-with-ITTL-without-freeze-LR-1e-05

@@ -16,9 +16,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [indolem/indobert-base-uncased](https://huggingface.co/indolem/indobert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.2740
-- Exact Match: 56.0847
-- F1: 70.6246
 ## Model description
@@ -38,10 +38,10 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
-- gradient_accumulation_steps: 16
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -51,31 +51,31 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Exact Match | F1      |
 |:-------------:|:-----:|:----:|:---------------:|:-----------:|:-------:|
-| 6.306         | 0.5   | 19   | 3.7982          | 6.5256      | 20.3655 |
-| 6.306         | 1.0   | 38   | 2.8932          | 14.1093     | 26.0679 |
-| 3.9254        | 1.5   | 57   | 2.4798          | 19.4004     | 32.1438 |
-| 3.9254        | 2.0   | 76   | 2.2955          | 26.1023     | 37.6331 |
-| 3.9254        | 2.5   | 95   | 2.1688          | 26.9841     | 39.2632 |
-| 2.4328        | 3.0   | 114  | 2.0701          | 30.1587     | 41.3438 |
-| 2.4328        | 3.5   | 133  | 1.9789          | 33.1570     | 45.0539 |
-| 2.1127        | 4.0   | 152  | 1.8465          | 37.2134     | 48.9042 |
-| 2.1127        | 4.5   | 171  | 1.7699          | 38.9771     | 50.9760 |
-| 2.1127        | 5.0   | 190  | 1.6885          | 41.0935     | 54.1550 |
-| 1.7875        | 5.5   | 209  | 1.5785          | 45.1499     | 58.6783 |
-| 1.7875        | 6.0   | 228  | 1.4954          | 49.2063     | 62.7869 |
-| 1.7875        | 6.5   | 247  | 1.4186          | 51.8519     | 65.7461 |
-| 1.5267        | 7.0   | 266  | 1.3734          | 53.4392     | 67.6141 |
-| 1.5267        | 7.5   | 285  | 1.3419          | 54.1446     | 68.2563 |
-| 1.3317        | 8.0   | 304  | 1.3116          | 55.5556     | 69.1996 |
-| 1.3317        | 8.5   | 323  | 1.2936          | 56.0847     | 69.9806 |
-| 1.3317        | 9.0   | 342  | 1.2900          | 56.2610     | 70.1634 |
-| 1.2556        | 9.5   | 361  | 1.2771          | 55.7319     | 70.1143 |
-| 1.2556        | 10.0  | 380  | 1.2740          | 56.0847     | 70.6246 |
 ### Framework versions
-- Transformers 4.26.1
 - Pytorch 1.13.1+cu117
 - Datasets 2.2.0
 - Tokenizers 0.13.2

 This model is a fine-tuned version of [indolem/indobert-base-uncased](https://huggingface.co/indolem/indobert-base-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.2784
+- Exact Match: 53.4392
+- F1: 68.7244
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
+- train_batch_size: 4
+- eval_batch_size: 4
 - seed: 42
+- gradient_accumulation_steps: 32
 - total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Exact Match | F1      |
 |:-------------:|:-----:|:----:|:---------------:|:-----------:|:-------:|
+| 6.1764        | 0.5   | 19   | 3.7674          | 10.4056     | 23.6332 |
+| 6.1764        | 1.0   | 38   | 2.7985          | 19.5767     | 32.6228 |
+| 3.8085        | 1.49  | 57   | 2.4169          | 22.0459     | 35.4084 |
+| 3.8085        | 1.99  | 76   | 2.2811          | 25.9259     | 38.3963 |
+| 3.8085        | 2.49  | 95   | 2.1607          | 28.0423     | 40.3901 |
+| 2.3932        | 2.99  | 114  | 2.0488          | 31.0406     | 43.7059 |
+| 2.3932        | 3.49  | 133  | 1.9787          | 34.3915     | 46.3655 |
+| 2.0772        | 3.98  | 152  | 1.8661          | 37.2134     | 49.1483 |
+| 2.0772        | 4.48  | 171  | 1.7893          | 40.2116     | 52.4989 |
+| 2.0772        | 4.98  | 190  | 1.7014          | 41.9753     | 54.9197 |
+| 1.7645        | 5.48  | 209  | 1.5940          | 44.2681     | 58.2134 |
+| 1.7645        | 5.98  | 228  | 1.4972          | 46.2081     | 60.4997 |
+| 1.7645        | 6.47  | 247  | 1.4214          | 48.8536     | 63.4371 |
+| 1.5035        | 6.97  | 266  | 1.3676          | 50.6173     | 65.4663 |
+| 1.5035        | 7.47  | 285  | 1.3357          | 52.2046     | 67.1759 |
+| 1.3206        | 7.97  | 304  | 1.3149          | 53.0864     | 68.0698 |
+| 1.3206        | 8.47  | 323  | 1.2988          | 53.4392     | 68.3971 |
+| 1.3206        | 8.96  | 342  | 1.2894          | 53.6155     | 68.8897 |
+| 1.2472        | 9.46  | 361  | 1.2820          | 53.4392     | 68.5835 |
+| 1.2472        | 9.96  | 380  | 1.2784          | 53.4392     | 68.7244 |
 ### Framework versions
+- Transformers 4.27.4
 - Pytorch 1.13.1+cu117
 - Datasets 2.2.0
 - Tokenizers 0.13.2