Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/3783
Title: Investigations on temporal-oriented event-based extractive summarization
Authors: Wu, Mingli
Keywords: Hong Kong Polytechnic University -- Dissertations
Automatic abstracting
Computational linguistics
Issue Date: 2008
Publisher: The Hong Kong Polytechnic University
Abstract: Automatic summarization aims to produce a concise summary of source documents by identifying the focused topics in documents. Normally, topics are represented by some essential events. Topics may evolve or shift over time. Tracking the trend of the topics requires anchoring events on the time line. Unfortunately, both events and their associated time features are not well studied in previous work. Investigating event-based and temporal-oriented summarization techniques are primary objectives of this study. As a matter of fact, the salience of contents could hardly be evaluated from single point of view. Exploiting a framework which can effectively integrate multiple impact factors is another objective. We define events by "action" words as well as associated named entities. Events weave documents into a map built either on event instances or event concepts. Relevance between events is exploited to identify important events. To utilize temporal information associated to events, it is necessary to extract and normalize temporal expressions. We investigate rule-based approaches for these tasks. Two statistical measures are employed to evaluate the significance of events based on their temporal distributions. Sentence selection is a complicated process. Therefore we explore various features including surface, content, event and relevance features under a learning-based classification framework. Event-based and temporal-oriented approaches are incorporated as features into this framework. The contributions of this study are listed as follows. (1) Event-based summarization approaches are proposed. They achieve competitive results when compared with successful word-based approaches. (2) Temporal concepts are introduced into event-based summarization and temporal information is found crucial to summarization on documents which contain evolving topics. (3) An adaptive leaning-based framework is developed to incorporate various types of features. (4) A system for temporal expression extraction and normalization is implemented. It is an effective tool not only practical for document summarization, but also for many other applications.
Description: xiii, 151 leaves : ill. ; 30 cm.
PolyU Library Call No.: [THS] LG51 .H577P COMP 2008 Wu
URI: http://hdl.handle.net/10397/3783
Rights: All rights reserved.
Appears in Collections:Thesis

Files in This Item:
File Description SizeFormat 
b21900346_link.htmFor PolyU Users 162 BHTMLView/Open
b21900346_ir.pdfFor All Users (Non-printable) 1.86 MBAdobe PDFView/Open
Show full item record

Page view(s)

431
Last Week
2
Last month
Checked on Jul 16, 2017

Download(s)

537
Checked on Jul 16, 2017

Google ScholarTM

Check



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.