Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/40012
Title: Chinese word segmentation based on word boundary decision
Other Titles: 基于词边界分类的中文分词方法
Authors: Li, S
Huang, CR 
Keywords: Computer application
Chinese information processing
Chinese word segmentation
WBD approach
Online learning
Issue Date: 2010
Publisher: 中国中文信息学会 ; 北京信息工程学院
Source: 中文信息学报 (Journal of Chinese information processing), 2010, v. 24, no. 1, p. 3-7 How to cite?
Journal: 中文信息学报 (Journal of Chinese information processing) 
Abstract: 该文研究和探讨一种新的分词方法:基 于词边界分类的方法。该方法直接对字符与字符之间的边界进行分类,判断其是否为两个词之间的边界,从而达到分词的目的。相对于目前主流的基于字标注的分词 方法,该方法的实现和训练更加快速、简单和直接,但却能获得比较接近的分词效果。更显著的是我们可以很容易地从词边界分类方法获得在线分词学习方法,该方 法能够使我们的分词系统非常迅速地学习新的标注样本。 
This paper focuses on the word boundary decision(WBD) approach to Chinese word segmentation.This new approach classifies a boundary between two characters into either a word boundary or not.Compared to the stat-of-the-arts methods based on character tagging,this approach is easier to implement and faster to execute,as well as a competitive performance.Particularly,the robust online learning module can be added to adapt a WBD system to new data quickly,enabling a reliable online Chinese segmentation system without domain or training data constraints. 
URI: http://hdl.handle.net/10397/40012
ISSN: 1003-0077
Rights: © 2010 China Academic Journal Electronic Publishing House. It is to be used strictly for educational and research use.
© 2010 中国学术期刊电子杂志出版社。本内容的使用仅限于教育、科研之目的。
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
r46898.pdf183.2 kBAdobe PDFView/Open
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

50
Last Week
1
Last month
Checked on Sep 18, 2017

Download(s)

61
Checked on Sep 18, 2017

Google ScholarTM

Check



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.