End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [muchad/idt5-base](https://huggingface.co/muchad/idt5-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.1193
-- Rouge1: 0.3286
-- Rouge2: 0.1753
-- Rougel: 0.3016
-- Rougelsum: 0.3039
-- Bleu: 0.1368
 ## Model description
@@ -53,13 +53,13 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu   |
-|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|
-| 1.4796        | 1.0   | 1695 | 1.2868          | 0.2748 | 0.1159 | 0.2489 | 0.2521    | 0.1543 |
-| 1.2884        | 2.0   | 3390 | 1.1966          | 0.3005 | 0.1461 | 0.2733 | 0.2760    | 0.1235 |
-| 1.1838        | 3.0   | 5085 | 1.1449          | 0.3188 | 0.1644 | 0.2914 | 0.2938    | 0.1319 |
-| 1.152         | 4.0   | 6780 | 1.1288          | 0.3266 | 0.1738 | 0.2997 | 0.3018    | 0.1364 |
-| 1.1397        | 5.0   | 8475 | 1.1193          | 0.3286 | 0.1753 | 0.3016 | 0.3039    | 0.1368 |
 ### Framework versions

 This model is a fine-tuned version of [muchad/idt5-base](https://huggingface.co/muchad/idt5-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6189
+- Rouge1: 0.2865
+- Rouge2: 0.1723
+- Rougel: 0.2835
+- Rougelsum: 0.2833
+- Bleu: 0.1327
 ## Model description
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu   |
+|:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|
+| 2.3292        | 1.0   | 3235  | 1.8863          | 0.2475 | 0.1449 | 0.2451 | 0.2454    | 0.1103 |
+| 2.0895        | 2.0   | 6470  | 1.6991          | 0.2761 | 0.1678 | 0.2725 | 0.2727    | 0.1388 |
+| 1.9092        | 3.0   | 9705  | 1.6346          | 0.2798 | 0.1671 | 0.2773 | 0.2775    | 0.1278 |
+| 1.9178        | 4.0   | 12940 | 1.6246          | 0.2839 | 0.1705 | 0.2813 | 0.2809    | 0.1227 |
+| 1.9038        | 5.0   | 16175 | 1.6189          | 0.2865 | 0.1723 | 0.2835 | 0.2833    | 0.1327 |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -16,7 +16,7 @@
   "megatron_core": "megatron.core",
   "modules_to_save": null,
   "peft_type": "LORA",
-  "r": 64,
   "rank_pattern": {},
   "revision": null,
   "target_modules": [

   "megatron_core": "megatron.core",
   "modules_to_save": null,
   "peft_type": "LORA",
+  "r": 8,
   "rank_pattern": {},
   "revision": null,
   "target_modules": [

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7063f1ec31870bcb8b24d75b30004cf3abf9d089a5815a99b7e49303148f3a5e
-size 28331904

 version https://git-lfs.github.com/spec/v1
+oid sha256:eeb4a507f4ef5b11c4d1f66f82bca2eb0f8fc76298501798c6af1d74ec0531a5
+size 3558888

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1f2863247f812421aaa7aba71749c7c4878ae4360a8b215308975ad0530255eb
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:8bbfe930a8403ae00efaf5a5b663deb6a06587d6058a1fa9f4fec2f1d365433d
 size 5368