Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/22491
Title: Document image binarization with feedback for improving character segmentation
Authors: Chi, Z 
Wang, Q
Keywords: Document image processing
Thresholding
Region-based binarization
Page segmentation
Character segmentation
Multi-layer perceptron
BP algorithm
Issue Date: 2005
Publisher: World Scientific
Source: International journal of image and graphics, 2005, v. 5, no. 2, p. 281-309 How to cite?
Journal: International journal of image and graphics 
Abstract: Binarization of gray scale document images is one of the most important steps in automatic document image processing. In this paper, we present a two-stage document image binarization approach, which includes a top-down region-based binarization at the first stage and a neural network based binarization technique for the problematic blocks at the second stage after a feedback checking. Our two-stage approach is particularly effective for binarizing text images of highlighted or marked text. The region-based binarization method is fast and suitable for processing large document images. However, the block effect and regional edge noise are two unavoidable problems resulting in poor character segmentation and recognition. The neural network based classifier can achieve good performance in two-class classification problem such as the binarization of gray level document images. However, it is computationally costly. In our two-stage binarization approach, the feedback criteria are employed to keep the well binarized blocks from the first stage binarization and to re-binarize the problematic blocks at the second stage using the neural network binarizer to improve the character segmentation quality. Experimental results on a number of document images show that our two-stage binarization approach performs better than the single-stage binarization techniques tested in terms of character segmentation quality and computational cost.
URI: http://hdl.handle.net/10397/22491
ISSN: 0219-4678
EISSN: 1793-6756
DOI: 10.1142/S0219467805001768
Appears in Collections:Journal/Magazine Article

Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page view(s)

38
Last Week
1
Last month
Checked on Aug 14, 2017

Google ScholarTM

Check

Altmetric



Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.