End of training
Browse files- README.md +15 -14
- generation_config.json +1 -0
README.md
CHANGED
@@ -18,10 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
18 |
This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
- Loss: 1.5103
|
21 |
-
- Rouge1:
|
22 |
-
- Rouge2:
|
23 |
-
- Rougel:
|
24 |
-
- Rougelsum:
|
|
|
25 |
|
26 |
## Model description
|
27 |
|
@@ -50,16 +51,16 @@ The following hyperparameters were used during training:
|
|
50 |
|
51 |
### Training results
|
52 |
|
53 |
-
| Training Loss | Epoch | Step | Validation Loss | Rouge1
|
54 |
-
|
55 |
-
| 1.9748 | 1.0 | 561 | 1.6071 |
|
56 |
-
| 1.7823 | 2.0 | 1122 | 1.5643 |
|
57 |
-
| 1.692 | 3.0 | 1683 | 1.5422 |
|
58 |
-
| 1.6226 | 4.0 | 2244 | 1.5243 |
|
59 |
-
| 1.5668 | 5.0 | 2805 | 1.5146 |
|
60 |
-
| 1.5281 | 6.0 | 3366 | 1.5083 |
|
61 |
-
| 1.5058 | 7.0 | 3927 | 1.5100 |
|
62 |
-
| 1.4834 | 8.0 | 4488 | 1.5103 |
|
63 |
|
64 |
|
65 |
### Framework versions
|
|
|
18 |
This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
- Loss: 1.5103
|
21 |
+
- Rouge1: 47.1409
|
22 |
+
- Rouge2: 24.7704
|
23 |
+
- Rougel: 43.2012
|
24 |
+
- Rougelsum: 43.2249
|
25 |
+
- Gen Len: 13.4509
|
26 |
|
27 |
## Model description
|
28 |
|
|
|
51 |
|
52 |
### Training results
|
53 |
|
54 |
+
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
55 |
+
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
|
56 |
+
| 1.9748 | 1.0 | 561 | 1.6071 | 45.4173 | 23.1498 | 41.2241 | 41.2184 | 14.0501 |
|
57 |
+
| 1.7823 | 2.0 | 1122 | 1.5643 | 45.6835 | 23.6188 | 42.0517 | 42.0965 | 13.3988 |
|
58 |
+
| 1.692 | 3.0 | 1683 | 1.5422 | 45.9189 | 23.4450 | 42.2140 | 42.2774 | 13.3287 |
|
59 |
+
| 1.6226 | 4.0 | 2244 | 1.5243 | 46.7272 | 24.5003 | 43.0697 | 43.1200 | 13.1984 |
|
60 |
+
| 1.5668 | 5.0 | 2805 | 1.5146 | 46.4580 | 24.0323 | 42.6975 | 42.7558 | 13.5050 |
|
61 |
+
| 1.5281 | 6.0 | 3366 | 1.5083 | 46.7795 | 24.2760 | 43.1536 | 43.1569 | 13.5010 |
|
62 |
+
| 1.5058 | 7.0 | 3927 | 1.5100 | 46.8698 | 24.6214 | 43.0844 | 43.0983 | 13.4509 |
|
63 |
+
| 1.4834 | 8.0 | 4488 | 1.5103 | 47.1409 | 24.7704 | 43.2012 | 43.2249 | 13.4509 |
|
64 |
|
65 |
|
66 |
### Framework versions
|
generation_config.json
CHANGED
@@ -1,6 +1,7 @@
|
|
1 |
{
|
2 |
"decoder_start_token_id": 0,
|
3 |
"eos_token_id": 1,
|
|
|
4 |
"pad_token_id": 0,
|
5 |
"transformers_version": "4.42.4"
|
6 |
}
|
|
|
1 |
{
|
2 |
"decoder_start_token_id": 0,
|
3 |
"eos_token_id": 1,
|
4 |
+
"max_new_tokens": 64,
|
5 |
"pad_token_id": 0,
|
6 |
"transformers_version": "4.42.4"
|
7 |
}
|