Optimal Parameter Selection Technique for a Neural Network Based Local Thresholding Method
The Journal of Pattern Recognition Research (JPRR) provides an international forum for the electronic publication of high-quality research and industrial experience articles in all areas of pattern recognition, machine learning, and artificial intelligence. JPRR is committed to rigorous yet rapid reviewing. Final versions are published electronically
(ISSN 1558-884X) immediately upon acceptance.
Optimal Parameter Selection Technique for a Neural Network Based Local Thresholding Method
Mohammed Jahirul Islam, Majid Ahmadi, Maher A Sid-Ahmed, Yasser M Alginahi
JPRR Vol 5, No 1 (2010); doi:10.13176/11.146 
Download
Mohammed Jahirul Islam, Majid Ahmadi, Maher A Sid-Ahmed, Yasser M Alginahi
Abstract
Thresholding of a given image into binary image is a necessary step for most image analysis and recognition techniques. In document recognition application, success of OCR mostly depends on the quality of the thresholded image. Non-uniform illumination, low contrast and complex background make it challenging in this application. In this paper, selection of optimal parameters for Neural Network (NN) based local thresholding approach for grey scale composite document image with non-uniform background is proposed. NN-based local image thresholding technique uses 8 statistical and textural image features to obtain a feature vector for each pixel from a window of size (2n+1)x(2n+1), where n>= 1. An exhaustive search was conducted on these features and found pixel value, mean and entropy are the optimal features at window size 3x3. To validate these 3 features some non-uniform watermarked document images with known binary document images called base documents are used. Characters were extracted from these watermarked documents using the proposed 3 features. The difference between the thresholded document and base document is the noise. A quantitative measure Peak-Signal-to-Noise ratio (PSNR) is used to measure the noise. In case of unknown base document characters were extracted through the proposed 3 features and used in a commercial OCR to obtain the character recognition rate. The average recognition rate 99.25% and PSNR shows that the proposed 3 features are the optimal compare to the NN-based thresholding technique with different parameters presented in the literature.
JPRR Vol 5, No 1 (2010); doi:10.13176/11.146 | Full Text  | Share this paper: