SiddhanthRaja commited on
Commit
40c82b6
·
1 Parent(s): 9113ee6

End of training

Browse files
README.md CHANGED
@@ -3,6 +3,8 @@ license: mit
3
  base_model: facebook/bart-large-cnn
4
  tags:
5
  - generated_from_trainer
 
 
6
  model-index:
7
  - name: bart-large-cnn-spotify-podcasts
8
  results: []
@@ -14,6 +16,13 @@ should probably proofread and complete it, then remove this comment. -->
14
  # bart-large-cnn-spotify-podcasts
15
 
16
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
 
 
 
 
 
 
 
17
 
18
  ## Model description
19
 
@@ -33,19 +42,22 @@ More information needed
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 2e-05
36
- - train_batch_size: 8
37
- - eval_batch_size: 8
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
41
- - num_epochs: 1
42
  - mixed_precision_training: Native AMP
43
 
44
  ### Training results
45
 
46
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
47
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
48
- | No log | 1.0 | 117 | 1.1526 | 0.4777 | 0.3071 | 0.3726 | 0.3741 | 97.4893 |
 
 
 
49
 
50
 
51
  ### Framework versions
 
3
  base_model: facebook/bart-large-cnn
4
  tags:
5
  - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
  model-index:
9
  - name: bart-large-cnn-spotify-podcasts
10
  results: []
 
16
  # bart-large-cnn-spotify-podcasts
17
 
18
  This model is a fine-tuned version of [facebook/bart-large-cnn](https://huggingface.co/facebook/bart-large-cnn) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.3282
21
+ - Rouge1: 0.5073
22
+ - Rouge2: 0.3354
23
+ - Rougel: 0.4104
24
+ - Rougelsum: 0.4098
25
+ - Gen Len: 82.5021
26
 
27
  ## Model description
28
 
 
42
 
43
  The following hyperparameters were used during training:
44
  - learning_rate: 2e-05
45
+ - train_batch_size: 4
46
+ - eval_batch_size: 4
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 4
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | No log | 1.0 | 233 | 1.2600 | 0.4886 | 0.3178 | 0.3942 | 0.3939 | 90.6652 |
58
+ | No log | 2.0 | 466 | 1.2273 | 0.4993 | 0.326 | 0.4012 | 0.4015 | 92.4077 |
59
+ | 1.0674 | 3.0 | 699 | 1.2512 | 0.5104 | 0.3406 | 0.4149 | 0.4148 | 84.0215 |
60
+ | 1.0674 | 4.0 | 932 | 1.3282 | 0.5073 | 0.3354 | 0.4104 | 0.4098 | 82.5021 |
61
 
62
 
63
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bfca8dacd50ad4be9b92a5a2da6805029e47fc624ca054805c2be38451b7beb4
3
  size 1625422896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6f657767e44f366ed1e76c89f47040f2949d4127dbfc1ad4fa05952d230e33c9
3
  size 1625422896
runs/Nov18_15-12-34_b5e0e1ae0024/events.out.tfevents.1700320362.b5e0e1ae0024.48.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:4805d3a671275dc18cd6c96929dedc69dbcfdbd7765f72aa05d42ed98674fc76
3
- size 6607
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8d2f14bb70775bd7ddc3e7843354e46827965d1109d1aa2ec616b25bb0483848
3
+ size 8011