Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/71409
PIRA download icon_1.1View/Download Full Text
Title: Difficulties in the application of “Chinese character component standard of GB 13000.1 character set for information processing” for Chinese character input
Other Title: 《信息处理用GB13000.1字符集汉字部件规范》在输入法应用中的难点讨论
Authors: Zhang, X 
Issue Date: 2004
Source: 中文信息学报 (Journal of Chinese information processing), 2004, v. 18, no. 4, p. 60-65
Abstract: 《信息处理用GB1 30 0 0 1字符集汉字部件规范》对于规范汉字形码输入法具有非常重要的意义。然而 ,在实际运用上却存在着部件数量太大 ,部件定义难以操作 ,部件拆分组合不易掌握等难处。造成困难的原因主要有 :(1 )基础部件主要靠列表来确定 ,(2 )部件强调按理切分和成字组合 ,(3)过多依赖“组字能力”的判别 ,(4 )过分注重部件数量的限制。要走出“难”的困境 ,应该在现有规范的基础上根据汉字的形态特征制定出简便可靠的部件识别规则和切分规则。实验证明 ,这种方法是行之有效的。
Chinese Character Component Standard of GB 13000.1 Character Set for Information Processing is an important document for the standardization of Chinese character input methods. Yet, when employed to the design and implementation of a nontrivial Chinese character input system, the standard encountered a number of difficulties: the hard to remember large number of coding components, the difficult to maneuver definition of basic components, and the poor rules for component disassembly and assembly. The sources of these difficulties include (a) definition of basic components by enumeration, (b) disassembly and assembly of components based on etymology and formation of characters, (c) reliance on the judgment of character forming capability of candidate components, and (d) over emphasis on the restriction of the number of basic components. To escape from this difficult position, we urgently need convenient and reliable rules for component identification and segmentation, which can be built up on the basis of the existing component standard by taking full advantage of the form features of Chinese characters. The feasibility and effectiveness of the proposed methodology have been verified by the successful development of the ZYQ Chinese character input system.
Keywords: Computer application
Chinese information processing
Chinese character input
Chinese character component
Standard
Publisher: 中国中文信息学会 ; 北京信息工程学院
Journal: 中文信息学报 (Journal of Chinese information processing) 
ISSN: 1003-0077
Rights: © 2004 中国学术期刊电子杂志出版社。本内容的使用仅限于教育、科研之目的。
© 2004 China Academic Journal Electronic Publishing House. It is to be used strictly for educational and research purposes.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
r22315.pdf187.03 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

224
Last Week
0
Last month
Citations as of Apr 14, 2024

Downloads

357
Citations as of Apr 14, 2024

Google ScholarTM

Check


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.