Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/40012
PIRA download icon_1.1View/Download Full Text
Title: Chinese word segmentation based on word boundary decision
Other Title: 基于词边界分类的中文分词方法
Authors: Li, S
Huang, CR 
Issue Date: 2010
Source: 中文信息学报 (Journal of Chinese information processing), 2010, v. 24, no. 1, p. 3-7
Abstract: 该文研究和探讨一种新的分词方法:基 于词边界分类的方法。该方法直接对字符与字符之间的边界进行分类,判断其是否为两个词之间的边界,从而达到分词的目的。相对于目前主流的基于字标注的分词 方法,该方法的实现和训练更加快速、简单和直接,但却能获得比较接近的分词效果。更显著的是我们可以很容易地从词边界分类方法获得在线分词学习方法,该方 法能够使我们的分词系统非常迅速地学习新的标注样本。 
This paper focuses on the word boundary decision(WBD) approach to Chinese word segmentation.This new approach classifies a boundary between two characters into either a word boundary or not.Compared to the stat-of-the-arts methods based on character tagging,this approach is easier to implement and faster to execute,as well as a competitive performance.Particularly,the robust online learning module can be added to adapt a WBD system to new data quickly,enabling a reliable online Chinese segmentation system without domain or training data constraints. 
Keywords: Computer application
Chinese information processing
Chinese word segmentation
WBD approach
Online learning
Publisher: 中国中文信息学会 ; 北京信息工程学院
Journal: 中文信息学报 (Journal of Chinese information processing) 
ISSN: 1003-0077
Rights: © 2010 China Academic Journal Electronic Publishing House. It is to be used strictly for educational and research use.
© 2010 中国学术期刊电子杂志出版社。本内容的使用仅限于教育、科研之目的。
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
r46898.pdf183.2 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

132
Last Week
0
Last month
Citations as of Apr 14, 2024

Downloads

334
Citations as of Apr 14, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.