Improving language modeling for (off-line) Chinese character recognition

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/85601

DC Field	Value	Language
dc.contributor	Department of Computing	-
dc.creator	Hung, Kei-yuen	-
dc.identifier.uri	https://theses.lib.polyu.edu.hk/handle/200/2353	-
dc.language.iso	English	-
dc.title	Improving language modeling for (off-line) Chinese character recognition	-
dc.type	Thesis	-
dcterms.abstract	We analyze the error characteristics of a Chinese character recognizer and developed two approaches to improve Chinese character recognition system. We first develop a non-contiguous context dependent language model as a post processing module. The model makes use of far away context to predict the interested character. The model is only as good as the traditional bigram model in terms of accuracy. Secondly, we developed a method to detect errors in language model. The method employs pattern recognition technique. It combines both dictionary and statistical features to predict whether a block of character is correct or contains error. This detection scheme as demonstrated in our experiment is effective. The performance is 80%, 91% and 75% of precision, recall and skip ratio respectively.	-
dcterms.accessRights	open access	-
dcterms.educationLevel	M.Phil.	-
dcterms.extent	x, 79 leaves : ill. ; 30 cm	-
dcterms.issued	2002	-
dcterms.LCSH	Hong Kong Polytechnic University -- Dissertations	-
dcterms.LCSH	Chinese language -- Data processing	-
dcterms.LCSH	Chinese character sets (Data processing)	-
Appears in Collections:	Thesis

Show simple item record

252

Last Week
11

Last month

Citations as of Apr 12, 2026

Check