Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/101456
PIRA download icon_1.1View/Download Full Text
Title: Generating a structured summary of numerous academic papers : dataset and method
Authors: Liu S 
Cao, J 
Yang, R 
Wen, Z 
Issue Date: 2022
Source: Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Messe Wien, Vienna, Austria, 23-29 July 2022, p. 4259-4265
Abstract: Writing a survey paper on one research topic usually needs to cover the salient content from numerous related papers, which can be modeled as a multi-document summarization (MDS) task. Existing MDS datasets usually focus on producing the structureless summary covering a few input documents. Meanwhile, previous structured summary generation works focus on summarizing a single document into a multi-section summary. These existing datasets and methods cannot meet the requirements of summarizing numerous academic papers into a structured summary. To deal with the scarcity of available data, we propose BigSurvey, the first large-scale dataset for generating comprehensive summaries of numerous academic papers on each topic. We collect target summaries from more than seven thousand survey papers and utilize their 430 thousand reference papers’ abstracts as input documents. To organize the diverse content from dozens of input documents and ensure the efficiency of processing long text sequences, we propose a summarization method named category-based alignment and sparse transformer (CAST). The experimental results show that our CAST method outperforms various advanced summarization methods.
Publisher: International Joint Conferences on Artificial Intelligence
ISBN: 978-1-956792-00-3 (Online ISBN)
DOI: 10.24963/ijcai.2022/591
Description: The 31st International Joint Conference on Artificial Intelligence. July 23-29,2022. Messe Wien, Vienna, Austria
Rights: Copyright © 2022 International Joint Conferences on Artificial Intelligence
Posted with the permission of the publisher.|
The following publication Liu, S., Cao, J., Yang, R., & Wen, Z. (2023). Generating a structured summary of numerous academic papers: Dataset and method. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence. Main Track. Pages 4259-4265 is available at https://doi.org/10.24963/ijcai.2022/591.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Generating_Structured_Summary.pdf327.42 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

103
Citations as of Oct 6, 2025

Downloads

47
Citations as of Oct 6, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.