Update README.md
Browse files
README.md
CHANGED
@@ -83,21 +83,22 @@ print("Generated Summary:", summary)
|
|
83 |
|
84 |
# Training hyperparameters
|
85 |
The following hyperparameters were used during training:
|
86 |
-
|
87 |
-
|
88 |
-
|
89 |
-
|
90 |
-
|
91 |
-
|
92 |
-
|
93 |
-
|
94 |
-
|
95 |
-
|
96 |
-
|
97 |
-
|
98 |
-
|
99 |
-
|
100 |
-
|
|
|
101 |
|
102 |
|
103 |
|
|
|
83 |
|
84 |
# Training hyperparameters
|
85 |
The following hyperparameters were used during training:
|
86 |
+
- accumulation_steps: 2
|
87 |
+
- num_epochs: 20
|
88 |
+
- num_beams: 6
|
89 |
+
- learning_rate: lr=5e-5
|
90 |
+
- optimizer: AdamW with betas=(0.9,0.999) and epsilon=1e-08
|
91 |
+
- activation_function: gelu
|
92 |
+
- add_bias_logits: True
|
93 |
+
- normalize_embedding: True
|
94 |
+
- add_final_layer_norm: False
|
95 |
+
- normalize_before: False
|
96 |
+
|
97 |
+
# Score
|
98 |
+
Evaluate the model with the test dataset of <a href="https://github.com/nakhunchumpolsathien/ThaiSum">ThaiSum</a>, consisting of a total of 11,000 articles, with the following scores:
|
99 |
+
- Rouge1: 0.490279
|
100 |
+
- Rouge2: 0.289839
|
101 |
+
- Rougel: 0.489334
|
102 |
|
103 |
|
104 |
|