jbochi commited on
Commit
33bca89
1 Parent(s): abc53ac

End of training

Browse files
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google/flan-t5-large
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: flan-t5-large-spelling-peft
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # flan-t5-large-spelling-peft
17
+
18
+ This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.2537
21
+ - Rouge1: 95.8905
22
+ - Rouge2: 91.9178
23
+ - Rougel: 95.8459
24
+ - Rougelsum: 95.8393
25
+ - Gen Len: 33.61
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 0.001
45
+ - train_batch_size: 64
46
+ - eval_batch_size: 64
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 1
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
+ | 0.3359 | 0.05 | 500 | 0.2738 | 95.8385 | 91.6723 | 95.7821 | 95.766 | 33.5 |
57
+ | 0.2853 | 0.11 | 1000 | 0.2702 | 95.7124 | 91.5043 | 95.656 | 95.651 | 33.53 |
58
+ | 0.2691 | 0.16 | 1500 | 0.2691 | 95.735 | 91.7108 | 95.7039 | 95.7067 | 33.41 |
59
+ | 0.2596 | 0.21 | 2000 | 0.2663 | 95.9819 | 92.0897 | 95.9519 | 95.9488 | 33.51 |
60
+ | 0.2536 | 0.27 | 2500 | 0.2621 | 95.7519 | 91.5445 | 95.6614 | 95.6622 | 33.49 |
61
+ | 0.2472 | 0.32 | 3000 | 0.2626 | 95.7052 | 91.7321 | 95.6476 | 95.6512 | 33.58 |
62
+ | 0.2448 | 0.37 | 3500 | 0.2669 | 95.8003 | 91.7949 | 95.7536 | 95.7576 | 33.57 |
63
+ | 0.2345 | 0.43 | 4000 | 0.2582 | 95.8784 | 92.008 | 95.8284 | 95.8343 | 33.65 |
64
+ | 0.2345 | 0.48 | 4500 | 0.2629 | 95.8131 | 91.9088 | 95.7624 | 95.766 | 33.63 |
65
+ | 0.2284 | 0.53 | 5000 | 0.2585 | 95.8552 | 91.9833 | 95.8105 | 95.8135 | 33.62 |
66
+ | 0.2266 | 0.59 | 5500 | 0.2591 | 95.9205 | 92.0577 | 95.8689 | 95.8718 | 33.61 |
67
+ | 0.2281 | 0.64 | 6000 | 0.2605 | 95.9172 | 91.9782 | 95.874 | 95.8638 | 33.59 |
68
+ | 0.2228 | 0.69 | 6500 | 0.2566 | 95.7612 | 91.7858 | 95.7129 | 95.7058 | 33.63 |
69
+ | 0.2202 | 0.75 | 7000 | 0.2561 | 95.9468 | 92.0914 | 95.9018 | 95.8941 | 33.64 |
70
+ | 0.218 | 0.8 | 7500 | 0.2579 | 95.9468 | 92.0914 | 95.9018 | 95.8941 | 33.64 |
71
+ | 0.2162 | 0.85 | 8000 | 0.2523 | 95.8231 | 91.9464 | 95.7727 | 95.7758 | 33.66 |
72
+ | 0.2135 | 0.91 | 8500 | 0.2549 | 95.8388 | 91.9804 | 95.7914 | 95.7917 | 33.63 |
73
+ | 0.2124 | 0.96 | 9000 | 0.2537 | 95.8905 | 91.9178 | 95.8459 | 95.8393 | 33.61 |
74
+
75
+
76
+ ### Framework versions
77
+
78
+ - Transformers 4.35.2
79
+ - Pytorch 2.1.0+cu121
80
+ - Datasets 2.16.0
81
+ - Tokenizers 0.15.0
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b503e236dac22d7bb8962438655c6ceb2e69ec872572b41289987083026fefa1
3
  size 18915328
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9aa3a5193f112b0746f815594e358e47477fcf67e15d3ca963efb93dc7669eba
3
  size 18915328
runs/Dec29_15-38-30_f1e6beea3151/events.out.tfevents.1703864362.f1e6beea3151.6133.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2995dac710ce781d34c104677750499f82a503b1ca2dd6c0a58aa9e803d8f496
3
- size 17199
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d4c06787a718cff3f1886559bd02858287bb555cfe5164422ea7e3e94f8a2e0
3
+ size 17553
tokenizer.json CHANGED
@@ -1,11 +1,6 @@
1
  {
2
  "version": "1.0",
3
- "truncation": {
4
- "direction": "Right",
5
- "max_length": 512,
6
- "strategy": "LongestFirst",
7
- "stride": 0
8
- },
9
  "padding": null,
10
  "added_tokens": [
11
  {
 
1
  {
2
  "version": "1.0",
3
+ "truncation": null,
 
 
 
 
 
4
  "padding": null,
5
  "added_tokens": [
6
  {