yihongwang
/

my_billsum_model

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

yihongwang commited on Sep 7, 2024

Commit

e868778

•

1 Parent(s): 09cc1e4

End of training

Files changed (1) hide show

README.md +16 -10

README.md CHANGED Viewed

@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.5593
-- Rouge1: 0.1425
-- Rouge2: 0.053
-- Rougel: 0.1186
-- Rougelsum: 0.1188
 - Gen Len: 19.0
 ## Model description
@@ -48,17 +48,23 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
-| No log        | 1.0   | 62   | 2.8332          | 0.1229 | 0.0373 | 0.0996 | 0.0997    | 19.0    |
-| No log        | 2.0   | 124  | 2.6310          | 0.1338 | 0.0456 | 0.1099 | 0.1099    | 19.0    |
-| No log        | 3.0   | 186  | 2.5732          | 0.139  | 0.0496 | 0.1156 | 0.1157    | 19.0    |
-| No log        | 4.0   | 248  | 2.5593          | 0.1425 | 0.053  | 0.1186 | 0.1188    | 19.0    |
 ### Framework versions

 This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.3699
+- Rouge1: 0.1958
+- Rouge2: 0.0949
+- Rougel: 0.167
+- Rougelsum: 0.167
 - Gen Len: 19.0
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
+| No log        | 1.0   | 62   | 2.7958          | 0.1228 | 0.0386 | 0.0997 | 0.1       | 19.0    |
+| No log        | 2.0   | 124  | 2.5846          | 0.1385 | 0.047  | 0.1139 | 0.114     | 19.0    |
+| No log        | 3.0   | 186  | 2.5034          | 0.1506 | 0.0563 | 0.1232 | 0.1234    | 19.0    |
+| No log        | 4.0   | 248  | 2.4548          | 0.1734 | 0.0756 | 0.1467 | 0.1468    | 19.0    |
+| No log        | 5.0   | 310  | 2.4231          | 0.1893 | 0.0877 | 0.1597 | 0.1597    | 19.0    |
+| No log        | 6.0   | 372  | 2.3991          | 0.1926 | 0.0913 | 0.1638 | 0.1638    | 19.0    |
+| No log        | 7.0   | 434  | 2.3862          | 0.1945 | 0.0944 | 0.166  | 0.166     | 19.0    |
+| No log        | 8.0   | 496  | 2.3764          | 0.195  | 0.094  | 0.1662 | 0.1663    | 19.0    |
+| 2.7718        | 9.0   | 558  | 2.3714          | 0.1959 | 0.0952 | 0.1672 | 0.1672    | 19.0    |
+| 2.7718        | 10.0  | 620  | 2.3699          | 0.1958 | 0.0949 | 0.167  | 0.167     | 19.0    |
 ### Framework versions