End of training

Files changed (12) hide show

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.1552
-- Bleu: 0.0813
-- Gen Len: 16.4792
 ## Model description
@@ -45,18 +45,23 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
-| 4.9485        | 1.0   | 144  | 4.4460          | 0.0    | 16.875  |
-| 4.515         | 2.0   | 288  | 4.2735          | 0.0    | 16.625  |
-| 4.3579        | 3.0   | 432  | 4.1977          | 0.0    | 16.7014 |
-| 4.3095        | 4.0   | 576  | 4.1644          | 0.0818 | 16.5417 |
-| 4.2744        | 5.0   | 720  | 4.1552          | 0.0813 | 16.4792 |
 ### Framework versions

 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.9119
+- Bleu: 0.1206
+- Gen Len: 16.6875
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
+| 4.9376        | 1.0   | 144  | 4.3320          | 0.1122 | 16.9514 |
+| 4.5062        | 2.0   | 288  | 4.1559          | 0.1191 | 15.9653 |
+| 4.3624        | 3.0   | 432  | 4.0699          | 0.1118 | 16.7361 |
+| 4.2832        | 4.0   | 576  | 4.0222          | 0.1474 | 16.3125 |
+| 4.2286        | 5.0   | 720  | 3.9854          | 0.1421 | 16.2778 |
+| 4.1762        | 6.0   | 864  | 3.9550          | 0.1107 | 16.9583 |
+| 4.14          | 7.0   | 1008 | 3.9386          | 0.1215 | 16.1944 |
+| 4.087         | 8.0   | 1152 | 3.9221          | 0.1232 | 16.7153 |
+| 4.0988        | 9.0   | 1296 | 3.9154          | 0.1206 | 16.6875 |
+| 4.0738        | 10.0  | 1440 | 3.9119          | 0.1206 | 16.6875 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ef0a51adef99bb0c335a98548f2911f9d3bd1b2bb9d0b40953248c994da60fc1
 size 242041896

 version https://git-lfs.github.com/spec/v1
+oid sha256:74018eca5257836ae7351a0ca339b40176188456b0943543e4e6dcd6c66336f9
 size 242041896

runs/Sep17_19-36-17_ip-10-192-12-112/events.out.tfevents.1726601782.ip-10-192-12-112.1319.18 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:a3dc4c6a07078d557dca62396e88d8b84360096651ed1606852dd9ca499c1b21
+size 9208

runs/Sep17_19-38-51_ip-10-192-12-112/events.out.tfevents.1726601939.ip-10-192-12-112.1319.19 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:cd6462613bd8080f6d13de659383a8d395a5e6618127b3d4987a415fc50d0688
+size 6884

runs/Sep22_09-15-13_ip-10-192-12-209/events.out.tfevents.1726996524.ip-10-192-12-209.1556.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:37ea71638cbcb010b9342e571eee1ab1b69a930e7b9b1f46091420917b6a8a37
+size 6884

runs/Sep22_09-16-22_ip-10-192-12-209/events.out.tfevents.1726996587.ip-10-192-12-209.1556.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:9d59cfa20037380d85a02e2e6aeb6a82739ecd05135ea78c77ca67fd822729d4
+size 9208

runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726996817.ip-10-192-12-209.6211.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:b7581d39ec1a1f77e3900c64f120278ef60b66bdba2d417d81d395344f7e4246
+size 9210

runs/Sep22_09-20-10_ip-10-192-12-209/events.out.tfevents.1726997064.ip-10-192-12-209.6211.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:c0a7508158d184765b0a7d418e910af45a218820036289dcf735d10595c9c82d
+size 9238

runs/Sep22_09-30-50_ip-10-192-12-209/events.out.tfevents.1726997458.ip-10-192-12-209.15554.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:60e748fd26f91e4a4b8880b90a5e266dc4c45908fe1b9cf0ae90400be15b6508
+size 9251

runs/Sep22_10-00-02_ip-10-192-12-209/events.out.tfevents.1726999203.ip-10-192-12-209.43955.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:69a0c83f69fd3b6005422e897d5f71414e4070e74028c54757e56ec719831960
+size 12114

tokenizer.json CHANGED Viewed

@@ -1,6 +1,11 @@
 {
   "version": "1.0",
-  "truncation": null,
   "padding": null,
   "added_tokens": [
     {

 {
   "version": "1.0",
+  "truncation": {
+    "direction": "Right",
+    "max_length": 128,
+    "strategy": "LongestFirst",
+    "stride": 0
+  },
   "padding": null,
   "added_tokens": [
     {

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d0182e1228d08bf24a4de1d1df8964f45fa2a8a5c19b929a6b7c93ef082f1ebc
 size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:d98df7c46aa141f4b035a0b5676c4b4bae82cec1f793a6427a332f37b21643e5
 size 5368