hawalurahman commited on
Commit
482c63b
·
verified ·
1 Parent(s): 708e5bc

End of training

Browse files
README.md CHANGED
@@ -19,12 +19,12 @@ should probably proofread and complete it, then remove this comment. -->
19
 
20
  This model is a fine-tuned version of [muchad/idt5-base](https://huggingface.co/muchad/idt5-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
- - Loss: 1.1193
23
- - Rouge1: 0.3286
24
- - Rouge2: 0.1753
25
- - Rougel: 0.3016
26
- - Rougelsum: 0.3039
27
- - Bleu: 0.1368
28
 
29
  ## Model description
30
 
@@ -53,13 +53,13 @@ The following hyperparameters were used during training:
53
 
54
  ### Training results
55
 
56
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu |
57
- |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|
58
- | 1.4796 | 1.0 | 1695 | 1.2868 | 0.2748 | 0.1159 | 0.2489 | 0.2521 | 0.1543 |
59
- | 1.2884 | 2.0 | 3390 | 1.1966 | 0.3005 | 0.1461 | 0.2733 | 0.2760 | 0.1235 |
60
- | 1.1838 | 3.0 | 5085 | 1.1449 | 0.3188 | 0.1644 | 0.2914 | 0.2938 | 0.1319 |
61
- | 1.152 | 4.0 | 6780 | 1.1288 | 0.3266 | 0.1738 | 0.2997 | 0.3018 | 0.1364 |
62
- | 1.1397 | 5.0 | 8475 | 1.1193 | 0.3286 | 0.1753 | 0.3016 | 0.3039 | 0.1368 |
63
 
64
 
65
  ### Framework versions
 
19
 
20
  This model is a fine-tuned version of [muchad/idt5-base](https://huggingface.co/muchad/idt5-base) on the None dataset.
21
  It achieves the following results on the evaluation set:
22
+ - Loss: 1.6189
23
+ - Rouge1: 0.2865
24
+ - Rouge2: 0.1723
25
+ - Rougel: 0.2835
26
+ - Rougelsum: 0.2833
27
+ - Bleu: 0.1327
28
 
29
  ## Model description
30
 
 
53
 
54
  ### Training results
55
 
56
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu |
57
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|:------:|
58
+ | 2.3292 | 1.0 | 3235 | 1.8863 | 0.2475 | 0.1449 | 0.2451 | 0.2454 | 0.1103 |
59
+ | 2.0895 | 2.0 | 6470 | 1.6991 | 0.2761 | 0.1678 | 0.2725 | 0.2727 | 0.1388 |
60
+ | 1.9092 | 3.0 | 9705 | 1.6346 | 0.2798 | 0.1671 | 0.2773 | 0.2775 | 0.1278 |
61
+ | 1.9178 | 4.0 | 12940 | 1.6246 | 0.2839 | 0.1705 | 0.2813 | 0.2809 | 0.1227 |
62
+ | 1.9038 | 5.0 | 16175 | 1.6189 | 0.2865 | 0.1723 | 0.2835 | 0.2833 | 0.1327 |
63
 
64
 
65
  ### Framework versions
adapter_config.json CHANGED
@@ -16,7 +16,7 @@
16
  "megatron_core": "megatron.core",
17
  "modules_to_save": null,
18
  "peft_type": "LORA",
19
- "r": 64,
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
 
16
  "megatron_core": "megatron.core",
17
  "modules_to_save": null,
18
  "peft_type": "LORA",
19
+ "r": 8,
20
  "rank_pattern": {},
21
  "revision": null,
22
  "target_modules": [
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:7063f1ec31870bcb8b24d75b30004cf3abf9d089a5815a99b7e49303148f3a5e
3
- size 28331904
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:eeb4a507f4ef5b11c4d1f66f82bca2eb0f8fc76298501798c6af1d74ec0531a5
3
+ size 3558888
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1f2863247f812421aaa7aba71749c7c4878ae4360a8b215308975ad0530255eb
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8bbfe930a8403ae00efaf5a5b663deb6a06587d6058a1fa9f4fec2f1d365433d
3
  size 5368