Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/16966
Title: Quality assurance for segmentation and tagging of Chinese novels in the Ming and Qing dynasties
Authors: Xiong, D
Lu, Q 
Lo, F
Shi, D 
Chiu, TS
Keywords: Named entities
Novels in the Ming and Qing dynasties
Quality assurance
Tagging
Word segmentation
Issue Date: 2012
Publisher: IEEE
Source: Proceedings - 2012 International Conference on Asian Language Processing, IALP 2012, 2012, 6473700, p. 77-80 How to cite?
Abstract: This paper presents a word segmentation and named entity tagging project which annotates Chinese novels in the Ming and Qing dynasties. Computer-aided tools are used to assist the annotation. The focus of this paper will be on the quality assurance measures to ensure precision and consistency. The specification for word segmentation and named entity tagging is formulated based on the standards for modern Chinese segmentation commonly used in Mainland China and in Taiwan as well as the analysis of differences between Chinese classics and modern Chinese. The specification is established through iterative refinements. This refinement process can offer valuable insights into the quality control of computer-aided processing performed on Chinese literature works in the Ming and Qing dynasties and can be applied to those in even earlier periods. The finalized corpus, built in a computer-aided, manually-reviewed method in accordance with the specification, can be used for researches in literature, linguistics, information technology, and teaching of Chinese.
Description: 2012 International Conference on Asian Language Processing, IALP 2012, Hanoi, 13-15 November 2012
URI: http://hdl.handle.net/10397/16966
DOI: 10.1109/IALP.2012.60
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

SCOPUSTM   
Citations

1
Last Week
0
Last month
0
Citations as of Sep 10, 2017

Page view(s)

61
Last Week
1
Last month
Checked on Sep 17, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.