metadata
language: en
license: apache-2.0
tags:
- summarization
datasets: arxiv-summarization
Introduction
A led-base-16384 model to summarize ArXiv papers. Inputs are the abstracts of papers and full documents, and outputs are the summaries of the papers.
Allenai's Longformer Encoder-Decoder (LED).
As described in Longformer: The Long-Document Transformer by Iz Beltagy, Matthew E. Peters, Arman Cohan, led-base-16384 was initialized from bart-base since both models share the exact same architecture. To be able to process 16K tokens, bart-base's position embedding matrix was simply copied 16 times.
Rouge 2
Type | Score |
---|---|
precision |
0.1839148953011932 |
recall |
0.14904707945189774 |
fmeasure |
0.1580026685776864 |