Improving OCR accuracy using adaptive image restoration
A technique that can improve the optical character recognition (OCR) accuracy of text images is presented. By using the output from an OCR system and a distorted text image, this technique trains an adaptive restoration ﬁlter and then applies the ﬁlter to the distorted text that the OCR system could not recognize. The restored text image is then reprocessed by the OCR system, and the restored characters are recognized with a higher accuracy than the distorted text. A series of experiments were performed to determine a feasible adaptive restoration ﬁlter architecture, to establish this technique’s ability to restore uniformly distorted text, and to demonstrate this technique’s ability to improve the OCR accuracy of real world text documents. The results of these experiments show that this technique can improve both pixel and OCR accuracy of distorted text images
Electric filters; Optical character recognition; Pattern recognition systems
Use Find in Your Library, contact the author, or use interlibrary loan to garner a copy of the article. Publisher copyright policy allows author to archive post-print (author’s final manuscript). When post-print is available or publisher policy changes, the article will be deposited
Improving OCR accuracy using adaptive image restoration.
SPIE Journal of Electronic Imaging, 5(3),