Back to results list
Please use this identifier to cite or link to this item:
|Title:||Investigations on temporal-oriented event-based extractive summarization||Authors:||Wu, Mingli||Keywords:||Hong Kong Polytechnic University -- Dissertations
|Issue Date:||2008||Publisher:||The Hong Kong Polytechnic University||Abstract:||Automatic summarization aims to produce a concise summary of source documents by identifying the focused topics in documents. Normally, topics are represented by some essential events. Topics may evolve or shift over time. Tracking the trend of the topics requires anchoring events on the time line. Unfortunately, both events and their associated time features are not well studied in previous work. Investigating event-based and temporal-oriented summarization techniques are primary objectives of this study. As a matter of fact, the salience of contents could hardly be evaluated from single point of view. Exploiting a framework which can effectively integrate multiple impact factors is another objective. We define events by "action" words as well as associated named entities. Events weave documents into a map built either on event instances or event concepts. Relevance between events is exploited to identify important events. To utilize temporal information associated to events, it is necessary to extract and normalize temporal expressions. We investigate rule-based approaches for these tasks. Two statistical measures are employed to evaluate the significance of events based on their temporal distributions. Sentence selection is a complicated process. Therefore we explore various features including surface, content, event and relevance features under a learning-based classification framework. Event-based and temporal-oriented approaches are incorporated as features into this framework. The contributions of this study are listed as follows. (1) Event-based summarization approaches are proposed. They achieve competitive results when compared with successful word-based approaches. (2) Temporal concepts are introduced into event-based summarization and temporal information is found crucial to summarization on documents which contain evolving topics. (3) An adaptive leaning-based framework is developed to incorporate various types of features. (4) A system for temporal expression extraction and normalization is implemented. It is an effective tool not only practical for document summarization, but also for many other applications.||Description:||xiii, 151 leaves : ill. ; 30 cm.
PolyU Library Call No.: [THS] LG51 .H577P COMP 2008 Wu
|URI:||http://hdl.handle.net/10397/3783||Rights:||All rights reserved.|
|Appears in Collections:||Thesis|
Show full item record
Files in This Item:
|b21900346_link.htm||For PolyU Users||162 B||HTML||View/Open|
|b21900346_ir.pdf||For All Users (Non-printable)||1.86 MB||Adobe PDF||View/Open|
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.