End of training
Browse files- README.md +12 -9
- model.safetensors +1 -1
README.md
CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
|
|
17 |
|
18 |
This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-3](https://huggingface.co/sshleifer/distilbart-xsum-12-3) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
-
- Loss: 2.
|
21 |
-
- Rouge1: 26.
|
22 |
-
- Rouge2:
|
23 |
-
- Rougel: 17.
|
24 |
-
- Rougelsum: 21.
|
25 |
-
- Gen Len:
|
26 |
|
27 |
## Model description
|
28 |
|
@@ -47,15 +47,18 @@ The following hyperparameters were used during training:
|
|
47 |
- seed: 42
|
48 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
49 |
- lr_scheduler_type: linear
|
50 |
-
- num_epochs:
|
51 |
- mixed_precision_training: Native AMP
|
52 |
|
53 |
### Training results
|
54 |
|
55 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
56 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
|
57 |
-
| 3.
|
58 |
-
|
|
|
|
|
|
|
|
59 |
|
60 |
|
61 |
### Framework versions
|
|
|
17 |
|
18 |
This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-3](https://huggingface.co/sshleifer/distilbart-xsum-12-3) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Loss: 2.5886
|
21 |
+
- Rouge1: 26.2164
|
22 |
+
- Rouge2: 8.042
|
23 |
+
- Rougel: 17.5545
|
24 |
+
- Rougelsum: 21.4745
|
25 |
+
- Gen Len: 62.0
|
26 |
|
27 |
## Model description
|
28 |
|
|
|
47 |
- seed: 42
|
48 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
49 |
- lr_scheduler_type: linear
|
50 |
+
- num_epochs: 5
|
51 |
- mixed_precision_training: Native AMP
|
52 |
|
53 |
### Training results
|
54 |
|
55 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
|
56 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
|
57 |
+
| 3.295 | 1.0 | 1041 | 3.0302 | 26.1938 | 8.0525 | 17.4251 | 21.3971 | 61.7637 |
|
58 |
+
| 2.9061 | 2.0 | 2082 | 2.7844 | 26.3284 | 7.8489 | 17.3299 | 21.487 | 61.951 |
|
59 |
+
| 2.7181 | 3.0 | 3123 | 2.6605 | 25.3295 | 7.5429 | 16.8791 | 21.0243 | 62.0 |
|
60 |
+
| 2.5903 | 4.0 | 4164 | 2.6097 | 25.5526 | 7.6456 | 17.1916 | 21.0674 | 61.9885 |
|
61 |
+
| 2.5327 | 5.0 | 5205 | 2.5886 | 26.2164 | 8.042 | 17.5545 | 21.4745 | 62.0 |
|
62 |
|
63 |
|
64 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 1020714768
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:784a0096b5e8c9c6d298c15fb16506a92c20b7fb562dc4f0ab2c2f9ea671c6de
|
3 |
size 1020714768
|