datht
/

distilbart-xsum-12-3-finetuned-xsum

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

datht commited on Apr 3

Commit

5a5c266

•

1 Parent(s): ef79020

End of training

Files changed (2) hide show

README.md +12 -9
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-3](https://huggingface.co/sshleifer/distilbart-xsum-12-3) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.9351
-- Rouge1: 26.3092
-- Rouge2: 7.7006
-- Rougel: 17.1028
-- Rougelsum: 21.4485
-- Gen Len: 61.9097
 ## Model description
@@ -47,15 +47,18 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
-| 3.3587        | 1.0   | 1041 | 3.0784          | 26.3422 | 7.8159 | 17.1532 | 21.3208   | 61.8223 |
-| 3.0404        | 2.0   | 2082 | 2.9351          | 26.3092 | 7.7006 | 17.1028 | 21.4485   | 61.9097 |
 ### Framework versions

 This model is a fine-tuned version of [sshleifer/distilbart-xsum-12-3](https://huggingface.co/sshleifer/distilbart-xsum-12-3) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.5886
+- Rouge1: 26.2164
+- Rouge2: 8.042
+- Rougel: 17.5545
+- Rougelsum: 21.4745
+- Gen Len: 62.0
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2 | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
+| 3.295         | 1.0   | 1041 | 3.0302          | 26.1938 | 8.0525 | 17.4251 | 21.3971   | 61.7637 |
+| 2.9061        | 2.0   | 2082 | 2.7844          | 26.3284 | 7.8489 | 17.3299 | 21.487    | 61.951  |
+| 2.7181        | 3.0   | 3123 | 2.6605          | 25.3295 | 7.5429 | 16.8791 | 21.0243   | 62.0    |
+| 2.5903        | 4.0   | 4164 | 2.6097          | 25.5526 | 7.6456 | 17.1916 | 21.0674   | 61.9885 |
+| 2.5327        | 5.0   | 5205 | 2.5886          | 26.2164 | 8.042  | 17.5545 | 21.4745   | 62.0    |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3c1a8c1f796c0d405909fc55589a7b42b20a8f741d68c27ec89949a019016c62
 size 1020714768

 version https://git-lfs.github.com/spec/v1
+oid sha256:784a0096b5e8c9c6d298c15fb16506a92c20b7fb562dc4f0ab2c2f9ea671c6de
 size 1020714768