Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/101456
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computingen_US
dc.creatorLiu Sen_US
dc.creatorCao, Jen_US
dc.creatorYang, Ren_US
dc.creatorWen, Zen_US
dc.date.accessioned2023-09-18T02:26:39Z-
dc.date.available2023-09-18T02:26:39Z-
dc.identifier.isbn978-1-956792-00-3 (Online ISBN)en_US
dc.identifier.urihttp://hdl.handle.net/10397/101456-
dc.descriptionThe 31st International Joint Conference on Artificial Intelligence. July 23-29,2022. Messe Wien, Vienna, Austriaen_US
dc.language.isoenen_US
dc.publisherInternational Joint Conferences on Artificial Intelligenceen_US
dc.rightsCopyright © 2022 International Joint Conferences on Artificial Intelligenceen_US
dc.rightsPosted with the permission of the publisher.|en_US
dc.rightsThe following publication Liu, S., Cao, J., Yang, R., & Wen, Z. (2023). Generating a structured summary of numerous academic papers: Dataset and method. Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence. Main Track. Pages 4259-4265 is available at https://doi.org/10.24963/ijcai.2022/591.en_US
dc.titleGenerating a structured summary of numerous academic papers : dataset and methoden_US
dc.typeConference Paperen_US
dc.identifier.spage4259en_US
dc.identifier.epage4265en_US
dc.identifier.doi10.24963/ijcai.2022/591en_US
dcterms.abstractWriting a survey paper on one research topic usually needs to cover the salient content from numerous related papers, which can be modeled as a multi-document summarization (MDS) task. Existing MDS datasets usually focus on producing the structureless summary covering a few input documents. Meanwhile, previous structured summary generation works focus on summarizing a single document into a multi-section summary. These existing datasets and methods cannot meet the requirements of summarizing numerous academic papers into a structured summary. To deal with the scarcity of available data, we propose BigSurvey, the first large-scale dataset for generating comprehensive summaries of numerous academic papers on each topic. We collect target summaries from more than seven thousand survey papers and utilize their 430 thousand reference papers’ abstracts as input documents. To organize the diverse content from dozens of input documents and ensure the efficiency of processing long text sequences, we propose a summarization method named category-based alignment and sparse transformer (CAST). The experimental results show that our CAST method outperforms various advanced summarization methods.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationProceedings of the Thirty-First International Joint Conference on Artificial Intelligence, Messe Wien, Vienna, Austria, 23-29 July 2022, p. 4259-4265en_US
dcterms.issued2022-
dc.identifier.ros2022002825-
dc.relation.ispartofbookProceedings of the Thirty-First International Joint Conference on Artificial Intelligenceen_US
dc.relation.conferenceInternational Joint Conference on Artificial Intelligence [IJCAI]en_US
dc.description.validate202309 bcwwen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberCDCF_2022-2023, a2276-
dc.identifier.SubFormID47300-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextHong Kong Jockey Club Charities Trust (Project S/N Ref.: 2021-0369)en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryPublisher permissionen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Generating_Structured_Summary.pdf327.42 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

103
Citations as of Oct 6, 2025

Downloads

47
Citations as of Oct 6, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.