Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/34095
Title: Distant BI-Gram model, collocation, and their applications in post-processing for Chinese character recognition
Authors: Xu, R
Lu, Q 
Yeung, D
Wang, XZ
Keywords: Character recognition
Natural languages
Probability
Statistical analysis
Issue Date: 2002
Publisher: IEEE
Source: 2002 International Conference on Machine Learning and Cybernetics, 2002 : proceedings : 4-5 November 2002, v. 4, p. 2251-2255 How to cite?
Abstract: In this paper, we present a distant BI-Gram model, which extended the regular BI-Gram model by considering the distance information and weight parameters, in order to describe the long-distance restrictions among the Chinese sentence. The extraction of the statistical information and weight parameters of this language model is discussed. Based on this work, the word combination strength and spread are employed to extract the recurrent word combinations, i.e. collocations. The distant BI-Gram model and collocation are applied to a statistic-based post-processing system for improving the recognition performance of Chinese characters. The experimental results show that by employing these two language models, the post-processing system achieves a higher improvement performance.
URI: http://hdl.handle.net/10397/34095
ISBN: 0-7803-7508-4
DOI: 10.1109/ICMLC.2002.1175440
Appears in Collections:Conference Paper

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

35
Last Week
1
Last month
Checked on Oct 15, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.