AdamCodd commited on
Commit
8d01593
1 Parent(s): 1cfab48

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +113 -0
README.md ADDED
@@ -0,0 +1,113 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ - seq2seq
6
+ - summarization
7
+ datasets:
8
+ - samsum
9
+ metrics:
10
+ - rouge
11
+ widget:
12
+ - text: >
13
+ Emily: Hey Alex, have you heard about the new restaurant that opened
14
+ downtown?
15
+
16
+ Alex: No, I haven't. What's it called?
17
+
18
+ Emily: It's called "Savory Bites." They say it has the best pasta in town.
19
+
20
+ Alex: That sounds delicious. When are you thinking of checking it out?
21
+
22
+ Emily: How about this Saturday? We can make it a dinner date.
23
+
24
+ Alex: Sounds like a plan, Emily. I'm looking forward to it.
25
+ model-index:
26
+ - name: bart-large-xsum-samsum
27
+ results:
28
+ - task:
29
+ type: summarization
30
+ name: Summarization
31
+ dataset:
32
+ name: >-
33
+ SAMSum Corpus: A Human-annotated Dialogue Dataset for Abstractive
34
+ Summarization
35
+ type: samsum
36
+ metrics:
37
+ - type: rouge-1
38
+ value: 54.3073
39
+ name: Validation ROUGE-1
40
+ - type: rouge-2
41
+ value: 29.0947
42
+ name: Validation ROUGE-2
43
+ - type: rouge-l
44
+ value: 44.4676
45
+ name: Validation ROUGE-L
46
+ ---
47
+
48
+ # bart-large-cnn-samsum
49
+
50
+ This model is a fine-tuned version of [facebook/bart-large-xsum](https://huggingface.co/facebook/bart-large-xsum) on the [samsum dataset](https://huggingface.co/datasets/samsum).
51
+ It achieves the following results on the evaluation set:
52
+ - Loss: 0.759
53
+ - Rouge1: 54.3073
54
+ - Rouge2: 29.0947
55
+ - Rougel: 44.4676
56
+ - Rougelsum: 49.895
57
+
58
+ ## Model description
59
+
60
+ More information needed
61
+
62
+ ## Intended uses & limitations
63
+
64
+ ```python
65
+ from transformers import pipeline
66
+ summarizer = pipeline("summarization", model="AdamCodd/bart-large-cnn-samsum")
67
+
68
+ conversation = '''Emily: Hey Alex, have you heard about the new restaurant that opened downtown?
69
+ Alex: No, I haven't. What's it called?
70
+ Emily: It's called "Savory Bites." They say it has the best pasta in town.
71
+ Alex: That sounds delicious. When are you thinking of checking it out?
72
+ Emily: How about this Saturday? We can make it a dinner date.
73
+ Alex: Sounds like a plan, Emily. I'm looking forward to it.
74
+ '''
75
+ result = summarizer(conversation)
76
+ print(result)
77
+ ```
78
+
79
+ ## Training and evaluation data
80
+
81
+ More information needed
82
+
83
+ ## Training procedure
84
+
85
+ ### Training hyperparameters
86
+
87
+ The following hyperparameters were used during training:
88
+ - learning_rate: 3e-05
89
+ - train_batch_size: 4
90
+ - eval_batch_size: 4
91
+ - seed: 1270
92
+ - optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
93
+ - lr_scheduler_type: linear
94
+ - lr_scheduler_warmup_steps: 150
95
+ - num_epochs: 1
96
+
97
+ ### Training results
98
+
99
+ | key | value |
100
+ | --- | ----- |
101
+ | eval_rouge1 | 54.3073 |
102
+ | eval_rouge2 | 29.0947 |
103
+ | eval_rougeL | 44.4676 |
104
+ | eval_rougeLsum | 49.895 |
105
+
106
+ ### Framework versions
107
+
108
+ - Transformers 4.35.0
109
+ - Accelerate 0.24.1
110
+ - Datasets 2.14.6
111
+ - Tokenizers 0.14.3
112
+
113
+ If you want to support me, you can [here](https://ko-fi.com/adamcodd).