End of training
Browse files- README.md +14 -9
- model.safetensors +1 -1
- runs/Sep17_19-36-17_ip-10-192-12-112/events.out.tfevents.1726601782.ip-10-192-12-112.1319.18 +3 -0
- runs/Sep17_19-38-51_ip-10-192-12-112/events.out.tfevents.1726601939.ip-10-192-12-112.1319.19 +3 -0
- runs/Sep22_09-15-13_ip-10-192-12-209/events.out.tfevents.1726996524.ip-10-192-12-209.1556.0 +3 -0
- runs/Sep22_09-16-22_ip-10-192-12-209/events.out.tfevents.1726996587.ip-10-192-12-209.1556.1 +3 -0
- runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726996817.ip-10-192-12-209.6211.0 +3 -0
- runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726997064.ip-10-192-12-209.6211.1 +3 -0
- runs/Sep22_09-30-50_ip-10-192-12-209/events.out.tfevents.1726997458.ip-10-192-12-209.15554.0 +3 -0
- runs/Sep22_10-00-02_ip-10-192-12-209/events.out.tfevents.1726999203.ip-10-192-12-209.43955.0 +3 -0
- tokenizer.json +6 -1
- training_args.bin +1 -1
README.md
CHANGED
@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
|
19 |
This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
-
- Loss:
|
22 |
-
- Bleu: 0.
|
23 |
-
- Gen Len: 16.
|
24 |
|
25 |
## Model description
|
26 |
|
@@ -45,18 +45,23 @@ The following hyperparameters were used during training:
|
|
45 |
- seed: 42
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
-
- num_epochs:
|
49 |
- mixed_precision_training: Native AMP
|
50 |
|
51 |
### Training results
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
|
55 |
-
| 4.
|
56 |
-
| 4.
|
57 |
-
| 4.
|
58 |
-
| 4.
|
59 |
-
| 4.
|
|
|
|
|
|
|
|
|
|
|
60 |
|
61 |
|
62 |
### Framework versions
|
|
|
18 |
|
19 |
This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
|
20 |
It achieves the following results on the evaluation set:
|
21 |
+
- Loss: 3.9119
|
22 |
+
- Bleu: 0.1206
|
23 |
+
- Gen Len: 16.6875
|
24 |
|
25 |
## Model description
|
26 |
|
|
|
45 |
- seed: 42
|
46 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
47 |
- lr_scheduler_type: linear
|
48 |
+
- num_epochs: 10
|
49 |
- mixed_precision_training: Native AMP
|
50 |
|
51 |
### Training results
|
52 |
|
53 |
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
54 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
|
55 |
+
| 4.9376 | 1.0 | 144 | 4.3320 | 0.1122 | 16.9514 |
|
56 |
+
| 4.5062 | 2.0 | 288 | 4.1559 | 0.1191 | 15.9653 |
|
57 |
+
| 4.3624 | 3.0 | 432 | 4.0699 | 0.1118 | 16.7361 |
|
58 |
+
| 4.2832 | 4.0 | 576 | 4.0222 | 0.1474 | 16.3125 |
|
59 |
+
| 4.2286 | 5.0 | 720 | 3.9854 | 0.1421 | 16.2778 |
|
60 |
+
| 4.1762 | 6.0 | 864 | 3.9550 | 0.1107 | 16.9583 |
|
61 |
+
| 4.14 | 7.0 | 1008 | 3.9386 | 0.1215 | 16.1944 |
|
62 |
+
| 4.087 | 8.0 | 1152 | 3.9221 | 0.1232 | 16.7153 |
|
63 |
+
| 4.0988 | 9.0 | 1296 | 3.9154 | 0.1206 | 16.6875 |
|
64 |
+
| 4.0738 | 10.0 | 1440 | 3.9119 | 0.1206 | 16.6875 |
|
65 |
|
66 |
|
67 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 242041896
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:74018eca5257836ae7351a0ca339b40176188456b0943543e4e6dcd6c66336f9
|
3 |
size 242041896
|
runs/Sep17_19-36-17_ip-10-192-12-112/events.out.tfevents.1726601782.ip-10-192-12-112.1319.18
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a3dc4c6a07078d557dca62396e88d8b84360096651ed1606852dd9ca499c1b21
|
3 |
+
size 9208
|
runs/Sep17_19-38-51_ip-10-192-12-112/events.out.tfevents.1726601939.ip-10-192-12-112.1319.19
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cd6462613bd8080f6d13de659383a8d395a5e6618127b3d4987a415fc50d0688
|
3 |
+
size 6884
|
runs/Sep22_09-15-13_ip-10-192-12-209/events.out.tfevents.1726996524.ip-10-192-12-209.1556.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:37ea71638cbcb010b9342e571eee1ab1b69a930e7b9b1f46091420917b6a8a37
|
3 |
+
size 6884
|
runs/Sep22_09-16-22_ip-10-192-12-209/events.out.tfevents.1726996587.ip-10-192-12-209.1556.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9d59cfa20037380d85a02e2e6aeb6a82739ecd05135ea78c77ca67fd822729d4
|
3 |
+
size 9208
|
runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726996817.ip-10-192-12-209.6211.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b7581d39ec1a1f77e3900c64f120278ef60b66bdba2d417d81d395344f7e4246
|
3 |
+
size 9210
|
runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726997064.ip-10-192-12-209.6211.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c0a7508158d184765b0a7d418e910af45a218820036289dcf735d10595c9c82d
|
3 |
+
size 9238
|
runs/Sep22_09-30-50_ip-10-192-12-209/events.out.tfevents.1726997458.ip-10-192-12-209.15554.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:60e748fd26f91e4a4b8880b90a5e266dc4c45908fe1b9cf0ae90400be15b6508
|
3 |
+
size 9251
|
runs/Sep22_10-00-02_ip-10-192-12-209/events.out.tfevents.1726999203.ip-10-192-12-209.43955.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:69a0c83f69fd3b6005422e897d5f71414e4070e74028c54757e56ec719831960
|
3 |
+
size 12114
|
tokenizer.json
CHANGED
@@ -1,6 +1,11 @@
|
|
1 |
{
|
2 |
"version": "1.0",
|
3 |
-
"truncation":
|
|
|
|
|
|
|
|
|
|
|
4 |
"padding": null,
|
5 |
"added_tokens": [
|
6 |
{
|
|
|
1 |
{
|
2 |
"version": "1.0",
|
3 |
+
"truncation": {
|
4 |
+
"direction": "Right",
|
5 |
+
"max_length": 128,
|
6 |
+
"strategy": "LongestFirst",
|
7 |
+
"stride": 0
|
8 |
+
},
|
9 |
"padding": null,
|
10 |
"added_tokens": [
|
11 |
{
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5368
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d98df7c46aa141f4b035a0b5676c4b4bae82cec1f793a6427a332f37b21643e5
|
3 |
size 5368
|