jq commited on
Commit
4c5361d
1 Parent(s): 84f38ce

End of training

Browse files
Files changed (30) hide show
  1. README.md +73 -7
  2. generation_config.json +0 -1
  3. model.safetensors +1 -1
  4. runs/May07_17-50-36_fde053a6581b/events.out.tfevents.1715104244.fde053a6581b.345.0 +3 -0
  5. runs/May07_17-52-42_fde053a6581b/events.out.tfevents.1715104366.fde053a6581b.345.1 +3 -0
  6. runs/May07_17-53-25_fde053a6581b/events.out.tfevents.1715104409.fde053a6581b.345.2 +3 -0
  7. runs/May07_17-55-06_fde053a6581b/events.out.tfevents.1715104509.fde053a6581b.345.3 +3 -0
  8. runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104582.fde053a6581b.345.4 +3 -0
  9. runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104692.fde053a6581b.345.5 +3 -0
  10. runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715105167.fde053a6581b.345.6 +3 -0
  11. runs/May07_18-10-34_fde053a6581b/events.out.tfevents.1715105438.fde053a6581b.345.7 +3 -0
  12. runs/May07_18-16-55_fde053a6581b/events.out.tfevents.1715105819.fde053a6581b.345.8 +3 -0
  13. runs/May07_18-18-24_fde053a6581b/events.out.tfevents.1715105907.fde053a6581b.345.9 +3 -0
  14. runs/May07_18-19-55_fde053a6581b/events.out.tfevents.1715106013.fde053a6581b.345.10 +3 -0
  15. runs/May07_18-24-33_fde053a6581b/events.out.tfevents.1715106276.fde053a6581b.345.11 +3 -0
  16. runs/May07_18-26-31_fde053a6581b/events.out.tfevents.1715106396.fde053a6581b.345.12 +3 -0
  17. runs/May07_18-27-11_fde053a6581b/events.out.tfevents.1715106434.fde053a6581b.345.13 +3 -0
  18. runs/May07_18-31-59_fde053a6581b/events.out.tfevents.1715106729.fde053a6581b.345.14 +3 -0
  19. runs/May07_18-32-36_fde053a6581b/events.out.tfevents.1715106762.fde053a6581b.345.15 +3 -0
  20. runs/May07_18-35-17_fde053a6581b/events.out.tfevents.1715106934.fde053a6581b.345.16 +3 -0
  21. runs/May07_18-35-57_fde053a6581b/events.out.tfevents.1715106965.fde053a6581b.345.17 +3 -0
  22. runs/May07_18-40-01_fde053a6581b/events.out.tfevents.1715107205.fde053a6581b.345.18 +3 -0
  23. runs/May07_18-48-56_fde053a6581b/events.out.tfevents.1715107739.fde053a6581b.345.19 +3 -0
  24. runs/May07_18-51-02_fde053a6581b/events.out.tfevents.1715107864.fde053a6581b.345.20 +3 -0
  25. runs/May07_18-57-19_fde053a6581b/events.out.tfevents.1715108244.fde053a6581b.345.21 +3 -0
  26. runs/May07_19-08-15_fde053a6581b/events.out.tfevents.1715108898.fde053a6581b.345.22 +3 -0
  27. runs/May07_19-10-26_fde053a6581b/events.out.tfevents.1715109028.fde053a6581b.345.23 +3 -0
  28. runs/May07_19-15-14_fde053a6581b/events.out.tfevents.1715109317.fde053a6581b.345.24 +3 -0
  29. runs/May07_19-24-06_fde053a6581b/events.out.tfevents.1715109848.fde053a6581b.345.25 +3 -0
  30. training_args.bin +2 -2
README.md CHANGED
@@ -3,9 +3,26 @@ license: apache-2.0
3
  base_model: google/t5-efficient-tiny
4
  tags:
5
  - generated_from_trainer
 
 
 
 
6
  model-index:
7
  - name: salt_language_ID
8
- results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
9
  ---
10
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -13,7 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # salt_language_ID
15
 
16
- This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on an unknown dataset.
 
 
 
17
 
18
  ## Model description
19
 
@@ -32,14 +52,60 @@ More information needed
32
  ### Training hyperparameters
33
 
34
  The following hyperparameters were used during training:
35
- - learning_rate: 0.0003
36
- - train_batch_size: 1024
37
- - eval_batch_size: 1024
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
- - lr_scheduler_warmup_steps: 500
42
- - num_epochs: 1
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
 
44
  ### Framework versions
45
 
 
3
  base_model: google/t5-efficient-tiny
4
  tags:
5
  - generated_from_trainer
6
+ datasets:
7
+ - generator
8
+ metrics:
9
+ - accuracy
10
  model-index:
11
  - name: salt_language_ID
12
+ results:
13
+ - task:
14
+ name: Sequence-to-sequence Language Modeling
15
+ type: text2text-generation
16
+ dataset:
17
+ name: generator
18
+ type: generator
19
+ config: default
20
+ split: train
21
+ args: default
22
+ metrics:
23
+ - name: Accuracy
24
+ type: accuracy
25
+ value: 0.9734543010752689
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
30
 
31
  # salt_language_ID
32
 
33
+ This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on the generator dataset.
34
+ It achieves the following results on the evaluation set:
35
+ - Loss: 0.0158
36
+ - Accuracy: 0.9735
37
 
38
  ## Model description
39
 
 
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
+ - learning_rate: 0.001
56
+ - train_batch_size: 64
57
+ - eval_batch_size: 64
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
+ - lr_scheduler_warmup_steps: 10
62
+ - training_steps: 20000
63
+
64
+ ### Training results
65
+
66
+ | Training Loss | Epoch | Step | Validation Loss | Accuracy |
67
+ |:-------------:|:-----:|:-----:|:---------------:|:--------:|
68
+ | 0.5256 | 0.025 | 500 | 0.1505 | 0.7698 |
69
+ | 0.0708 | 0.05 | 1000 | 0.0447 | 0.9237 |
70
+ | 0.0547 | 0.075 | 1500 | 0.0540 | 0.9093 |
71
+ | 0.0476 | 0.1 | 2000 | 0.0428 | 0.9264 |
72
+ | 0.0413 | 0.125 | 2500 | 0.0334 | 0.9399 |
73
+ | 0.0404 | 0.15 | 3000 | 0.0293 | 0.9479 |
74
+ | 0.0374 | 0.175 | 3500 | 0.0324 | 0.9459 |
75
+ | 0.0359 | 0.2 | 4000 | 0.0257 | 0.9493 |
76
+ | 0.0353 | 0.225 | 4500 | 0.0267 | 0.9513 |
77
+ | 0.0336 | 0.25 | 5000 | 0.0234 | 0.9587 |
78
+ | 0.0337 | 0.275 | 5500 | 0.0253 | 0.9560 |
79
+ | 0.0324 | 0.3 | 6000 | 0.0186 | 0.9684 |
80
+ | 0.0307 | 0.325 | 6500 | 0.0208 | 0.9634 |
81
+ | 0.028 | 0.35 | 7000 | 0.0253 | 0.9573 |
82
+ | 0.0297 | 0.375 | 7500 | 0.0224 | 0.9617 |
83
+ | 0.0264 | 0.4 | 8000 | 0.0224 | 0.9607 |
84
+ | 0.027 | 0.425 | 8500 | 0.0185 | 0.9667 |
85
+ | 0.0266 | 0.45 | 9000 | 0.0222 | 0.9634 |
86
+ | 0.0259 | 0.475 | 9500 | 0.0221 | 0.9617 |
87
+ | 0.0244 | 0.5 | 10000 | 0.0187 | 0.9688 |
88
+ | 0.0243 | 0.525 | 10500 | 0.0164 | 0.9694 |
89
+ | 0.0248 | 0.55 | 11000 | 0.0184 | 0.9674 |
90
+ | 0.024 | 0.575 | 11500 | 0.0155 | 0.9704 |
91
+ | 0.0228 | 0.6 | 12000 | 0.0176 | 0.9671 |
92
+ | 0.0241 | 0.625 | 12500 | 0.0146 | 0.9755 |
93
+ | 0.0234 | 0.65 | 13000 | 0.0181 | 0.9681 |
94
+ | 0.0226 | 0.675 | 13500 | 0.0142 | 0.9758 |
95
+ | 0.0225 | 0.7 | 14000 | 0.0169 | 0.9718 |
96
+ | 0.0218 | 0.725 | 14500 | 0.0151 | 0.9711 |
97
+ | 0.0212 | 0.75 | 15000 | 0.0176 | 0.9735 |
98
+ | 0.0199 | 0.775 | 15500 | 0.0131 | 0.9741 |
99
+ | 0.0208 | 0.8 | 16000 | 0.0131 | 0.9775 |
100
+ | 0.0217 | 0.825 | 16500 | 0.0123 | 0.9788 |
101
+ | 0.0208 | 0.85 | 17000 | 0.0145 | 0.9758 |
102
+ | 0.0217 | 0.875 | 17500 | 0.0154 | 0.9694 |
103
+ | 0.0197 | 0.9 | 18000 | 0.0138 | 0.9765 |
104
+ | 0.0205 | 0.925 | 18500 | 0.0138 | 0.9748 |
105
+ | 0.0203 | 0.95 | 19000 | 0.0146 | 0.9748 |
106
+ | 0.0198 | 0.975 | 19500 | 0.0131 | 0.9755 |
107
+ | 0.0204 | 1.0 | 20000 | 0.0158 | 0.9735 |
108
+
109
 
110
  ### Framework versions
111
 
generation_config.json CHANGED
@@ -1,5 +1,4 @@
1
  {
2
- "_from_model_config": true,
3
  "decoder_start_token_id": 0,
4
  "eos_token_id": 1,
5
  "pad_token_id": 0,
 
1
  {
 
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:39448328f244ad703ec202bb8c16e31adce5d6001067d8c8102f671c85a3f6fc
3
  size 62293080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:788da342e2c62b25dd9ab59ef2893286998723719f9989a383aa107f4615b14e
3
  size 62293080
runs/May07_17-50-36_fde053a6581b/events.out.tfevents.1715104244.fde053a6581b.345.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f77399b0934141db6e108df411e9b53a6fe84ee08ed1166ed0091f0aa35267a7
3
+ size 5236
runs/May07_17-52-42_fde053a6581b/events.out.tfevents.1715104366.fde053a6581b.345.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b78947329c02196f04e6eb9c83286fc2e24050458f0c21bf50997ea91dfc0ab
3
+ size 5236
runs/May07_17-53-25_fde053a6581b/events.out.tfevents.1715104409.fde053a6581b.345.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57aec9061f9288e10ba01afbb46b38454414b49d6fdd3e4d2e151dab77142601
3
+ size 5236
runs/May07_17-55-06_fde053a6581b/events.out.tfevents.1715104509.fde053a6581b.345.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:74f89df7ab198fc008851fc6c6b29e1a6bddea9bcae0dc8169288c567aed4ecf
3
+ size 5236
runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104582.fde053a6581b.345.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b6ca5f0ca7f15d2699ab081e52682f91ed2b5588bdbd7d5e1e8cb2dcbb94ceb4
3
+ size 5236
runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104692.fde053a6581b.345.5 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d5f3005f1691e90b42e7a602ea26bee191212339fe3b321ca0d0c29e3869256
3
+ size 5236
runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715105167.fde053a6581b.345.6 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:935cb62c15af6c62aceb2c6373f3497c322c2e387c53de1aaa2ca42cdd2f769d
3
+ size 5236
runs/May07_18-10-34_fde053a6581b/events.out.tfevents.1715105438.fde053a6581b.345.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58a95c1955d5f99452ba02169f766a67e6c09752953e69989eea80dba05026f5
3
+ size 5235
runs/May07_18-16-55_fde053a6581b/events.out.tfevents.1715105819.fde053a6581b.345.8 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2d99c365cfc56ae8239e55992205c18504bf1f79bdc42dc8dddd610487cc8b1d
3
+ size 5235
runs/May07_18-18-24_fde053a6581b/events.out.tfevents.1715105907.fde053a6581b.345.9 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:006641656fefbd49debafddf7630154a8d4ca5b1717b2e44087538b443c4f1aa
3
+ size 6076
runs/May07_18-19-55_fde053a6581b/events.out.tfevents.1715106013.fde053a6581b.345.10 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ae0b3fedcdad2e09aff9b73e082fd461b847ebacdd11fda785b4e2ba052ec8d1
3
+ size 6807
runs/May07_18-24-33_fde053a6581b/events.out.tfevents.1715106276.fde053a6581b.345.11 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d4c277a0ea3987230ee348221d905699af0268c1d9bb7f94f7619e48223a29cf
3
+ size 5758
runs/May07_18-26-31_fde053a6581b/events.out.tfevents.1715106396.fde053a6581b.345.12 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0730326bbf6146c2d120aa6216b869dc808cf402e3f53122bf86e3bb7fb3a794
3
+ size 5234
runs/May07_18-27-11_fde053a6581b/events.out.tfevents.1715106434.fde053a6581b.345.13 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19bd27706c0a732a87e310f9a139d3312fef68c940a200fda35df90d2d7812f8
3
+ size 5567
runs/May07_18-31-59_fde053a6581b/events.out.tfevents.1715106729.fde053a6581b.345.14 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5cd44a1410d3688001c75d19f0b255aa1107fd44482b84c254619999fce42e76
3
+ size 5031
runs/May07_18-32-36_fde053a6581b/events.out.tfevents.1715106762.fde053a6581b.345.15 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3f8674f3f4922b0ac0510ba27463127124497a8d12c587e121d8c9f345aacec
3
+ size 5553
runs/May07_18-35-17_fde053a6581b/events.out.tfevents.1715106934.fde053a6581b.345.16 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a18ddbf4edd29957dfb08584098a0f3c27ebabd4693c0acb2e994fa1567719ad
3
+ size 346
runs/May07_18-35-57_fde053a6581b/events.out.tfevents.1715106965.fde053a6581b.345.17 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:843b1115111d84ab6f8331b6158edcaae1e357d9c11ca7784f0467d7eed91e4b
3
+ size 346
runs/May07_18-40-01_fde053a6581b/events.out.tfevents.1715107205.fde053a6581b.345.18 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:69dd9f16c29918ea695844152eb877d0604decd3d064d1ba1b0e25454d3d808e
3
+ size 8221
runs/May07_18-48-56_fde053a6581b/events.out.tfevents.1715107739.fde053a6581b.345.19 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ab900a0c0019fe65ee20b8969ff51f94c6c7facd4d80bf6dc55fe8954f5e8381
3
+ size 5551
runs/May07_18-51-02_fde053a6581b/events.out.tfevents.1715107864.fde053a6581b.345.20 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e8e3c5433311e7b35369ed12ea0d54891c7ab86a9b916284e0d4227fdaf55d13
3
+ size 7165
runs/May07_18-57-19_fde053a6581b/events.out.tfevents.1715108244.fde053a6581b.345.21 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6971080560cc53ecad1f08f506e1ca8b3e6fe94bf9d1b88ee9b300c04653b3e9
3
+ size 6096
runs/May07_19-08-15_fde053a6581b/events.out.tfevents.1715108898.fde053a6581b.345.22 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9b7f1128ce214d7df1220b18b24352b423c13096bdf2bc660c81941a125eb917
3
+ size 6630
runs/May07_19-10-26_fde053a6581b/events.out.tfevents.1715109028.fde053a6581b.345.23 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:406a51e1b5d58e86bde34c5b973577160a08de936d72b42300315a3f445a1403
3
+ size 5560
runs/May07_19-15-14_fde053a6581b/events.out.tfevents.1715109317.fde053a6581b.345.24 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aeb1cbf59f1d26a11fc1b1190a98e39f49924282332affb7633b44774a03b1ee
3
+ size 10366
runs/May07_19-24-06_fde053a6581b/events.out.tfevents.1715109848.fde053a6581b.345.25 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9eb6a278a8f4853d96074acb2ad434714923a6254c2e1d45e0c646dbc3a7fb60
3
+ size 26826
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9d0a7824b28d408f1268ebf4643da549dc93a6096da96ee275b33e21a13be40a
3
- size 5112
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:57f28fd303968b531e3a369c31229f95178865b23bbf46d2e68a9c8a5b320757
3
+ size 5176