End of training
Browse files- README.md +73 -7
- generation_config.json +0 -1
- model.safetensors +1 -1
- runs/May07_17-50-36_fde053a6581b/events.out.tfevents.1715104244.fde053a6581b.345.0 +3 -0
- runs/May07_17-52-42_fde053a6581b/events.out.tfevents.1715104366.fde053a6581b.345.1 +3 -0
- runs/May07_17-53-25_fde053a6581b/events.out.tfevents.1715104409.fde053a6581b.345.2 +3 -0
- runs/May07_17-55-06_fde053a6581b/events.out.tfevents.1715104509.fde053a6581b.345.3 +3 -0
- runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104582.fde053a6581b.345.4 +3 -0
- runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104692.fde053a6581b.345.5 +3 -0
- runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715105167.fde053a6581b.345.6 +3 -0
- runs/May07_18-10-34_fde053a6581b/events.out.tfevents.1715105438.fde053a6581b.345.7 +3 -0
- runs/May07_18-16-55_fde053a6581b/events.out.tfevents.1715105819.fde053a6581b.345.8 +3 -0
- runs/May07_18-18-24_fde053a6581b/events.out.tfevents.1715105907.fde053a6581b.345.9 +3 -0
- runs/May07_18-19-55_fde053a6581b/events.out.tfevents.1715106013.fde053a6581b.345.10 +3 -0
- runs/May07_18-24-33_fde053a6581b/events.out.tfevents.1715106276.fde053a6581b.345.11 +3 -0
- runs/May07_18-26-31_fde053a6581b/events.out.tfevents.1715106396.fde053a6581b.345.12 +3 -0
- runs/May07_18-27-11_fde053a6581b/events.out.tfevents.1715106434.fde053a6581b.345.13 +3 -0
- runs/May07_18-31-59_fde053a6581b/events.out.tfevents.1715106729.fde053a6581b.345.14 +3 -0
- runs/May07_18-32-36_fde053a6581b/events.out.tfevents.1715106762.fde053a6581b.345.15 +3 -0
- runs/May07_18-35-17_fde053a6581b/events.out.tfevents.1715106934.fde053a6581b.345.16 +3 -0
- runs/May07_18-35-57_fde053a6581b/events.out.tfevents.1715106965.fde053a6581b.345.17 +3 -0
- runs/May07_18-40-01_fde053a6581b/events.out.tfevents.1715107205.fde053a6581b.345.18 +3 -0
- runs/May07_18-48-56_fde053a6581b/events.out.tfevents.1715107739.fde053a6581b.345.19 +3 -0
- runs/May07_18-51-02_fde053a6581b/events.out.tfevents.1715107864.fde053a6581b.345.20 +3 -0
- runs/May07_18-57-19_fde053a6581b/events.out.tfevents.1715108244.fde053a6581b.345.21 +3 -0
- runs/May07_19-08-15_fde053a6581b/events.out.tfevents.1715108898.fde053a6581b.345.22 +3 -0
- runs/May07_19-10-26_fde053a6581b/events.out.tfevents.1715109028.fde053a6581b.345.23 +3 -0
- runs/May07_19-15-14_fde053a6581b/events.out.tfevents.1715109317.fde053a6581b.345.24 +3 -0
- runs/May07_19-24-06_fde053a6581b/events.out.tfevents.1715109848.fde053a6581b.345.25 +3 -0
- training_args.bin +2 -2
README.md
CHANGED
@@ -3,9 +3,26 @@ license: apache-2.0
|
|
3 |
base_model: google/t5-efficient-tiny
|
4 |
tags:
|
5 |
- generated_from_trainer
|
|
|
|
|
|
|
|
|
6 |
model-index:
|
7 |
- name: salt_language_ID
|
8 |
-
results:
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
---
|
10 |
|
11 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -13,7 +30,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
13 |
|
14 |
# salt_language_ID
|
15 |
|
16 |
-
This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on
|
|
|
|
|
|
|
17 |
|
18 |
## Model description
|
19 |
|
@@ -32,14 +52,60 @@ More information needed
|
|
32 |
### Training hyperparameters
|
33 |
|
34 |
The following hyperparameters were used during training:
|
35 |
-
- learning_rate: 0.
|
36 |
-
- train_batch_size:
|
37 |
-
- eval_batch_size:
|
38 |
- seed: 42
|
39 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
40 |
- lr_scheduler_type: linear
|
41 |
-
- lr_scheduler_warmup_steps:
|
42 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
43 |
|
44 |
### Framework versions
|
45 |
|
|
|
3 |
base_model: google/t5-efficient-tiny
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
+
datasets:
|
7 |
+
- generator
|
8 |
+
metrics:
|
9 |
+
- accuracy
|
10 |
model-index:
|
11 |
- name: salt_language_ID
|
12 |
+
results:
|
13 |
+
- task:
|
14 |
+
name: Sequence-to-sequence Language Modeling
|
15 |
+
type: text2text-generation
|
16 |
+
dataset:
|
17 |
+
name: generator
|
18 |
+
type: generator
|
19 |
+
config: default
|
20 |
+
split: train
|
21 |
+
args: default
|
22 |
+
metrics:
|
23 |
+
- name: Accuracy
|
24 |
+
type: accuracy
|
25 |
+
value: 0.9734543010752689
|
26 |
---
|
27 |
|
28 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
30 |
|
31 |
# salt_language_ID
|
32 |
|
33 |
+
This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on the generator dataset.
|
34 |
+
It achieves the following results on the evaluation set:
|
35 |
+
- Loss: 0.0158
|
36 |
+
- Accuracy: 0.9735
|
37 |
|
38 |
## Model description
|
39 |
|
|
|
52 |
### Training hyperparameters
|
53 |
|
54 |
The following hyperparameters were used during training:
|
55 |
+
- learning_rate: 0.001
|
56 |
+
- train_batch_size: 64
|
57 |
+
- eval_batch_size: 64
|
58 |
- seed: 42
|
59 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
60 |
- lr_scheduler_type: linear
|
61 |
+
- lr_scheduler_warmup_steps: 10
|
62 |
+
- training_steps: 20000
|
63 |
+
|
64 |
+
### Training results
|
65 |
+
|
66 |
+
| Training Loss | Epoch | Step | Validation Loss | Accuracy |
|
67 |
+
|:-------------:|:-----:|:-----:|:---------------:|:--------:|
|
68 |
+
| 0.5256 | 0.025 | 500 | 0.1505 | 0.7698 |
|
69 |
+
| 0.0708 | 0.05 | 1000 | 0.0447 | 0.9237 |
|
70 |
+
| 0.0547 | 0.075 | 1500 | 0.0540 | 0.9093 |
|
71 |
+
| 0.0476 | 0.1 | 2000 | 0.0428 | 0.9264 |
|
72 |
+
| 0.0413 | 0.125 | 2500 | 0.0334 | 0.9399 |
|
73 |
+
| 0.0404 | 0.15 | 3000 | 0.0293 | 0.9479 |
|
74 |
+
| 0.0374 | 0.175 | 3500 | 0.0324 | 0.9459 |
|
75 |
+
| 0.0359 | 0.2 | 4000 | 0.0257 | 0.9493 |
|
76 |
+
| 0.0353 | 0.225 | 4500 | 0.0267 | 0.9513 |
|
77 |
+
| 0.0336 | 0.25 | 5000 | 0.0234 | 0.9587 |
|
78 |
+
| 0.0337 | 0.275 | 5500 | 0.0253 | 0.9560 |
|
79 |
+
| 0.0324 | 0.3 | 6000 | 0.0186 | 0.9684 |
|
80 |
+
| 0.0307 | 0.325 | 6500 | 0.0208 | 0.9634 |
|
81 |
+
| 0.028 | 0.35 | 7000 | 0.0253 | 0.9573 |
|
82 |
+
| 0.0297 | 0.375 | 7500 | 0.0224 | 0.9617 |
|
83 |
+
| 0.0264 | 0.4 | 8000 | 0.0224 | 0.9607 |
|
84 |
+
| 0.027 | 0.425 | 8500 | 0.0185 | 0.9667 |
|
85 |
+
| 0.0266 | 0.45 | 9000 | 0.0222 | 0.9634 |
|
86 |
+
| 0.0259 | 0.475 | 9500 | 0.0221 | 0.9617 |
|
87 |
+
| 0.0244 | 0.5 | 10000 | 0.0187 | 0.9688 |
|
88 |
+
| 0.0243 | 0.525 | 10500 | 0.0164 | 0.9694 |
|
89 |
+
| 0.0248 | 0.55 | 11000 | 0.0184 | 0.9674 |
|
90 |
+
| 0.024 | 0.575 | 11500 | 0.0155 | 0.9704 |
|
91 |
+
| 0.0228 | 0.6 | 12000 | 0.0176 | 0.9671 |
|
92 |
+
| 0.0241 | 0.625 | 12500 | 0.0146 | 0.9755 |
|
93 |
+
| 0.0234 | 0.65 | 13000 | 0.0181 | 0.9681 |
|
94 |
+
| 0.0226 | 0.675 | 13500 | 0.0142 | 0.9758 |
|
95 |
+
| 0.0225 | 0.7 | 14000 | 0.0169 | 0.9718 |
|
96 |
+
| 0.0218 | 0.725 | 14500 | 0.0151 | 0.9711 |
|
97 |
+
| 0.0212 | 0.75 | 15000 | 0.0176 | 0.9735 |
|
98 |
+
| 0.0199 | 0.775 | 15500 | 0.0131 | 0.9741 |
|
99 |
+
| 0.0208 | 0.8 | 16000 | 0.0131 | 0.9775 |
|
100 |
+
| 0.0217 | 0.825 | 16500 | 0.0123 | 0.9788 |
|
101 |
+
| 0.0208 | 0.85 | 17000 | 0.0145 | 0.9758 |
|
102 |
+
| 0.0217 | 0.875 | 17500 | 0.0154 | 0.9694 |
|
103 |
+
| 0.0197 | 0.9 | 18000 | 0.0138 | 0.9765 |
|
104 |
+
| 0.0205 | 0.925 | 18500 | 0.0138 | 0.9748 |
|
105 |
+
| 0.0203 | 0.95 | 19000 | 0.0146 | 0.9748 |
|
106 |
+
| 0.0198 | 0.975 | 19500 | 0.0131 | 0.9755 |
|
107 |
+
| 0.0204 | 1.0 | 20000 | 0.0158 | 0.9735 |
|
108 |
+
|
109 |
|
110 |
### Framework versions
|
111 |
|
generation_config.json
CHANGED
@@ -1,5 +1,4 @@
|
|
1 |
{
|
2 |
-
"_from_model_config": true,
|
3 |
"decoder_start_token_id": 0,
|
4 |
"eos_token_id": 1,
|
5 |
"pad_token_id": 0,
|
|
|
1 |
{
|
|
|
2 |
"decoder_start_token_id": 0,
|
3 |
"eos_token_id": 1,
|
4 |
"pad_token_id": 0,
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 62293080
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:788da342e2c62b25dd9ab59ef2893286998723719f9989a383aa107f4615b14e
|
3 |
size 62293080
|
runs/May07_17-50-36_fde053a6581b/events.out.tfevents.1715104244.fde053a6581b.345.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f77399b0934141db6e108df411e9b53a6fe84ee08ed1166ed0091f0aa35267a7
|
3 |
+
size 5236
|
runs/May07_17-52-42_fde053a6581b/events.out.tfevents.1715104366.fde053a6581b.345.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2b78947329c02196f04e6eb9c83286fc2e24050458f0c21bf50997ea91dfc0ab
|
3 |
+
size 5236
|
runs/May07_17-53-25_fde053a6581b/events.out.tfevents.1715104409.fde053a6581b.345.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:57aec9061f9288e10ba01afbb46b38454414b49d6fdd3e4d2e151dab77142601
|
3 |
+
size 5236
|
runs/May07_17-55-06_fde053a6581b/events.out.tfevents.1715104509.fde053a6581b.345.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:74f89df7ab198fc008851fc6c6b29e1a6bddea9bcae0dc8169288c567aed4ecf
|
3 |
+
size 5236
|
runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104582.fde053a6581b.345.4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b6ca5f0ca7f15d2699ab081e52682f91ed2b5588bdbd7d5e1e8cb2dcbb94ceb4
|
3 |
+
size 5236
|
runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715104692.fde053a6581b.345.5
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7d5f3005f1691e90b42e7a602ea26bee191212339fe3b321ca0d0c29e3869256
|
3 |
+
size 5236
|
runs/May07_17-56-19_fde053a6581b/events.out.tfevents.1715105167.fde053a6581b.345.6
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:935cb62c15af6c62aceb2c6373f3497c322c2e387c53de1aaa2ca42cdd2f769d
|
3 |
+
size 5236
|
runs/May07_18-10-34_fde053a6581b/events.out.tfevents.1715105438.fde053a6581b.345.7
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:58a95c1955d5f99452ba02169f766a67e6c09752953e69989eea80dba05026f5
|
3 |
+
size 5235
|
runs/May07_18-16-55_fde053a6581b/events.out.tfevents.1715105819.fde053a6581b.345.8
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2d99c365cfc56ae8239e55992205c18504bf1f79bdc42dc8dddd610487cc8b1d
|
3 |
+
size 5235
|
runs/May07_18-18-24_fde053a6581b/events.out.tfevents.1715105907.fde053a6581b.345.9
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:006641656fefbd49debafddf7630154a8d4ca5b1717b2e44087538b443c4f1aa
|
3 |
+
size 6076
|
runs/May07_18-19-55_fde053a6581b/events.out.tfevents.1715106013.fde053a6581b.345.10
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ae0b3fedcdad2e09aff9b73e082fd461b847ebacdd11fda785b4e2ba052ec8d1
|
3 |
+
size 6807
|
runs/May07_18-24-33_fde053a6581b/events.out.tfevents.1715106276.fde053a6581b.345.11
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d4c277a0ea3987230ee348221d905699af0268c1d9bb7f94f7619e48223a29cf
|
3 |
+
size 5758
|
runs/May07_18-26-31_fde053a6581b/events.out.tfevents.1715106396.fde053a6581b.345.12
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0730326bbf6146c2d120aa6216b869dc808cf402e3f53122bf86e3bb7fb3a794
|
3 |
+
size 5234
|
runs/May07_18-27-11_fde053a6581b/events.out.tfevents.1715106434.fde053a6581b.345.13
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:19bd27706c0a732a87e310f9a139d3312fef68c940a200fda35df90d2d7812f8
|
3 |
+
size 5567
|
runs/May07_18-31-59_fde053a6581b/events.out.tfevents.1715106729.fde053a6581b.345.14
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5cd44a1410d3688001c75d19f0b255aa1107fd44482b84c254619999fce42e76
|
3 |
+
size 5031
|
runs/May07_18-32-36_fde053a6581b/events.out.tfevents.1715106762.fde053a6581b.345.15
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d3f8674f3f4922b0ac0510ba27463127124497a8d12c587e121d8c9f345aacec
|
3 |
+
size 5553
|
runs/May07_18-35-17_fde053a6581b/events.out.tfevents.1715106934.fde053a6581b.345.16
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a18ddbf4edd29957dfb08584098a0f3c27ebabd4693c0acb2e994fa1567719ad
|
3 |
+
size 346
|
runs/May07_18-35-57_fde053a6581b/events.out.tfevents.1715106965.fde053a6581b.345.17
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:843b1115111d84ab6f8331b6158edcaae1e357d9c11ca7784f0467d7eed91e4b
|
3 |
+
size 346
|
runs/May07_18-40-01_fde053a6581b/events.out.tfevents.1715107205.fde053a6581b.345.18
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:69dd9f16c29918ea695844152eb877d0604decd3d064d1ba1b0e25454d3d808e
|
3 |
+
size 8221
|
runs/May07_18-48-56_fde053a6581b/events.out.tfevents.1715107739.fde053a6581b.345.19
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ab900a0c0019fe65ee20b8969ff51f94c6c7facd4d80bf6dc55fe8954f5e8381
|
3 |
+
size 5551
|
runs/May07_18-51-02_fde053a6581b/events.out.tfevents.1715107864.fde053a6581b.345.20
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e8e3c5433311e7b35369ed12ea0d54891c7ab86a9b916284e0d4227fdaf55d13
|
3 |
+
size 7165
|
runs/May07_18-57-19_fde053a6581b/events.out.tfevents.1715108244.fde053a6581b.345.21
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6971080560cc53ecad1f08f506e1ca8b3e6fe94bf9d1b88ee9b300c04653b3e9
|
3 |
+
size 6096
|
runs/May07_19-08-15_fde053a6581b/events.out.tfevents.1715108898.fde053a6581b.345.22
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9b7f1128ce214d7df1220b18b24352b423c13096bdf2bc660c81941a125eb917
|
3 |
+
size 6630
|
runs/May07_19-10-26_fde053a6581b/events.out.tfevents.1715109028.fde053a6581b.345.23
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:406a51e1b5d58e86bde34c5b973577160a08de936d72b42300315a3f445a1403
|
3 |
+
size 5560
|
runs/May07_19-15-14_fde053a6581b/events.out.tfevents.1715109317.fde053a6581b.345.24
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:aeb1cbf59f1d26a11fc1b1190a98e39f49924282332affb7633b44774a03b1ee
|
3 |
+
size 10366
|
runs/May07_19-24-06_fde053a6581b/events.out.tfevents.1715109848.fde053a6581b.345.25
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9eb6a278a8f4853d96074acb2ad434714923a6254c2e1d45e0c646dbc3a7fb60
|
3 |
+
size 26826
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:57f28fd303968b531e3a369c31229f95178865b23bbf46d2e68a9c8a5b320757
|
3 |
+
size 5176
|