Master of Science (MS)
Number of Pages
This thesis will attempt to establish if synthesized images can be used to predict the performance of Optical Character Recognition (OCR) algorithms and devices. The value of this research lies in reducing the considerable costs associated with preparing test images for OCR research. The paper reports on a series of experiments in which synthesized images of text files in nine different fonts and sizes are input to eight commercial OCR devices. The method used to create the images is explained and a detailed analysis of the character and word confusion between the output and the true text files is presented. The synthesized images are then printed and scanned to mechanically introduce "noise". The resulting images are also input to the devices and analysis performed. A high correlation was found between the output from the printed and scanned images and the output from "real world" images.
Algorithms; Devices; Evaluate; Images; OCR; Performance; Synthesized
University of Nevada, Las Vegas
If you are the rightful copyright holder of this dissertation or thesis and wish to have the full text removed from Digital Scholarship@UNLV, please submit a request to firstname.lastname@example.org and include clear identification of the work, preferably with URL.
Jenkins, Frank Robert, "The use of synthesized images to evaluate the performance of Ocr devices and algorithms" (1993). UNLV Retrospective Theses & Dissertations. 304.