Introducing Yale Scisumm Dataset

Rui Zhang
LILY Lab
Published in
1 min readMar 13, 2019

A summary of scientific papers should ideally incorporate the impact of the papers on the research community reflected by citations. We have developed the first large-scale, human-annotated Scisumm dataset, ScisummNet. It provides over 1,000 papers in the ACL anthology network with their citation networks (e.g. citation sentences, citation counts) and their comprehensive, manual summaries.

Check out our paper published in AAAI 2019, ScisummNet: A Large Annotated Corpus and Content-Impact Models for Scientific Paper Summarization with Citation Networks, and the project site.

--

--