Add evaluation results on the default config of billsum
Browse filesBeep boop, I am a bot from Hugging Face's automatic model evaluator 👋!\
Your model has been evaluated on the default config of the [billsum](https://huggingface.co/datasets/billsum) dataset by
@pszemraj
, using the predictions stored [here](https://huggingface.co/datasets/autoevaluate/autoeval-staging-eval-billsum-default-3fec5f-14625985).\
Accept this pull request to see the results displayed on the [Hub leaderboard](https://huggingface.co/spaces/autoevaluate/leaderboards?dataset=billsum).\
Evaluate your model on more datasets [here](https://huggingface.co/spaces/autoevaluate/model-evaluator?dataset=billsum).
README.md
CHANGED
@@ -113,6 +113,39 @@ model-index:
|
|
113 |
type: gen_len
|
114 |
value: 82.729
|
115 |
verified: true
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
116 |
---
|
117 |
|
118 |
# long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP
|
|
|
113 |
type: gen_len
|
114 |
value: 82.729
|
115 |
verified: true
|
116 |
+
- task:
|
117 |
+
type: summarization
|
118 |
+
name: Summarization
|
119 |
+
dataset:
|
120 |
+
name: billsum
|
121 |
+
type: billsum
|
122 |
+
config: default
|
123 |
+
split: test
|
124 |
+
metrics:
|
125 |
+
- name: ROUGE-1
|
126 |
+
type: rouge
|
127 |
+
value: 36.5688
|
128 |
+
verified: true
|
129 |
+
- name: ROUGE-2
|
130 |
+
type: rouge
|
131 |
+
value: 12.5849
|
132 |
+
verified: true
|
133 |
+
- name: ROUGE-L
|
134 |
+
type: rouge
|
135 |
+
value: 22.2461
|
136 |
+
verified: true
|
137 |
+
- name: ROUGE-LSUM
|
138 |
+
type: rouge
|
139 |
+
value: 30.6507
|
140 |
+
verified: true
|
141 |
+
- name: loss
|
142 |
+
type: loss
|
143 |
+
value: 2.6456267833709717
|
144 |
+
verified: true
|
145 |
+
- name: gen_len
|
146 |
+
type: gen_len
|
147 |
+
value: 139.0398
|
148 |
+
verified: true
|
149 |
---
|
150 |
|
151 |
# long-t5-tglobal-large-pubmed-3k-booksum-16384-WIP
|