Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/10978
Title: A unicode based adaptive segmentor
Authors: Lu, Q 
Chan, ST
Li, BL
Yu, SW
Issue Date: 2004
Source: Journal of Chinese language and computing, 2004, v. 14, no. 3, p. 221-234 How to cite?
Journal: Journal of Chinese language and computing 
Abstract: This paper presents a Unicode based Chinese word segmentor. It can handle Chinese text in Simplified, Traditional, or mixed mode. The system uses the strategy of divide-and-conquer to handle the recognition of personal names, numbers, time and numerical values, etc in the preprocessing stage. The segmentor further uses tagging information to work on disambiguation. Adopting a modular design approach, different functional parts are separately implemented using different modules and each module tackles one problem at a time providing more flexibility and extensibility. Results show that with added pre-processing modules and accessorial modules, the accuracy of the segmentor is increased and the system is easily adaptive to different applications.
URI: http://hdl.handle.net/10397/10978
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

36
Last Week
2
Last month
Checked on Nov 12, 2017

Google ScholarTM

Check



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.