yimiwang commited on
Commit
88b75ca
1 Parent(s): afbf901

End of training

Browse files
Files changed (1) hide show
  1. README.md +12 -11
README.md CHANGED
@@ -18,12 +18,12 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 1.7712
22
- - Rouge1: 43.5898
23
- - Rouge2: 17.9519
24
- - Rougel: 31.6266
25
- - Rougelsum: 40.4599
26
- - Gen Len: 100.2965
27
 
28
  ## Model description
29
 
@@ -43,19 +43,20 @@ More information needed
43
 
44
  The following hyperparameters were used during training:
45
  - learning_rate: 5e-05
46
- - train_batch_size: 5
47
- - eval_batch_size: 5
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
- - num_epochs: 2
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
57
- | 1.911 | 1.0 | 3374 | 1.7750 | 43.3477 | 17.7556 | 31.5789 | 40.2601 | 96.8776 |
58
- | 1.901 | 2.0 | 6748 | 1.7712 | 43.5898 | 17.9519 | 31.6266 | 40.4599 | 100.2965 |
 
59
 
60
 
61
  ### Framework versions
 
18
 
19
  This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 1.7588
22
+ - Rouge1: 43.5001
23
+ - Rouge2: 17.8611
24
+ - Rougel: 31.5148
25
+ - Rougelsum: 40.2359
26
+ - Gen Len: 98.7718
27
 
28
  ## Model description
29
 
 
43
 
44
  The following hyperparameters were used during training:
45
  - learning_rate: 5e-05
46
+ - train_batch_size: 6
47
+ - eval_batch_size: 6
48
  - seed: 42
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 3
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:--------:|
57
+ | 1.9097 | 1.0 | 2812 | 1.7698 | 43.3373 | 17.6766 | 31.4769 | 40.0561 | 97.4492 |
58
+ | 1.8901 | 2.0 | 5624 | 1.7603 | 43.4367 | 17.8114 | 31.5721 | 40.1962 | 100.2230 |
59
+ | 1.8854 | 3.0 | 8436 | 1.7588 | 43.5001 | 17.8611 | 31.5148 | 40.2359 | 98.7718 |
60
 
61
 
62
  ### Framework versions