Multi-task learning for abstractive and extractive summarization

Chen, Y; Ma, Y; Mao, X; Li, Q

doi:10.1007/s41019-019-0087-7

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/89095

DC Field	Value	Language
dc.contributor	Department of Computing	-
dc.creator	Chen, Y	-
dc.creator	Ma, Y	-
dc.creator	Mao, X	-
dc.creator	Li, Q	-
dc.date.accessioned	2021-02-04T02:39:18Z	-
dc.date.available	2021-02-04T02:39:18Z	-
dc.identifier.issn	2364-1185	-
dc.identifier.uri	http://hdl.handle.net/10397/89095	-
dc.language.iso	en	en_US
dc.publisher	SpringerOpen	en_US
dc.rights	© The Author(s) 2019	en_US
dc.rights	Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.	en_US
dc.rights	The following publication Chen, Y., Ma, Y., Mao, X., & Li, Q. (2019). Multi-task learning for abstractive and extractive summarization. Data Science and Engineering, 4(1), 14-23 is available at https://dx.doi.org/10.1007/s41019-019-0087-7	en_US
dc.subject	Attention mechanism	en_US
dc.subject	Automatic document summarization	en_US
dc.subject	Multi-Task learning	en_US
dc.title	Multi-task learning for abstractive and extractive summarization	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	14	-
dc.identifier.epage	23	-
dc.identifier.volume	4	-
dc.identifier.issue	1	-
dc.identifier.doi	10.1007/s41019-019-0087-7	-
dcterms.abstract	The abstractive method and extractive method are two main approaches for automatic document summarization. In this paper, to fully integrate the relatedness and advantages of both approaches, we propose a general unified framework for abstractive summarization which incorporates extractive summarization as an auxiliary task. In particular, our framework is composed of a shared hierarchical document encoder, a hierarchical attention mechanism-based decoder, and an extractor. We adopt multi-task learning method to train these two tasks jointly, which enables the shared encoder to better capture the semantics of the document. Moreover, as our main task is abstractive summarization, we constrain the attention learned in the abstractive task with the labels of the extractive task to strengthen the consistency between the two tasks. Experiments on the CNN/DailyMail dataset demonstrate that both the auxiliary task and the attention constraint contribute to improve the performance significantly, and our model is comparable to the state-of-the-art abstractive models. In addition, we cut half number of labels of the extractive task, pretrain the extractor, and jointly train the two tasks using the estimated sentence salience of the extractive task to constrain the attention of the abstractive task. The results do not decrease much compared with using full-labeled data of the auxiliary task.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	Data science and engineering, Mar. 2019, v. 4, no. 1, p. 14-23	-
dcterms.isPartOf	Data science and engineering	-
dcterms.issued	2019-03	-
dc.identifier.scopus	2-s2.0-85065029539	-
dc.identifier.eissn	2364-1541	-
dc.description.validate	202101 bcrc	-
dc.description.oa	Version of Record	en_US
dc.identifier.FolderNumber	OA_Scopus/WOS	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	CC	en_US
Appears in Collections:	Journal/Magazine Article

Files in This Item:

File	Description	Size	Format
Chen2019_Article_Multi-TaskLearningForAbstracti.pdf		1.63 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show simple item record

Page views

108

Last Week
0

Last month

Citations as of Nov 9, 2025

Downloads

37

Citations as of Nov 9, 2025

SCOPUS^TM
Citations

45

Citations as of Dec 19, 2025

Google Scholar^TM

Check