armanc/scientific_papers
Updated
•
1.75k
•
148
Note Scientific papers datasets contains two sets of long and structured documents. The datasets are obtained from ArXiv and PubMed OpenAccess repositories. Both "arxiv" and "pubmed" have two features: - article: the body of the document, paragraphs separated by "/n". - abstract: the abstract of the document, paragraphs separated by "/n". - section_names: titles of sections, separated by "/n".