Award Date


Degree Type


Degree Name

Master of Science (MS)


Computer Science

Number of Pages



This thesis will attempt to establish if synthesized images can be used to predict the performance of Optical Character Recognition (OCR) algorithms and devices. The value of this research lies in reducing the considerable costs associated with preparing test images for OCR research. The paper reports on a series of experiments in which synthesized images of text files in nine different fonts and sizes are input to eight commercial OCR devices. The method used to create the images is explained and a detailed analysis of the character and word confusion between the output and the true text files is presented. The synthesized images are then printed and scanned to mechanically introduce "noise". The resulting images are also input to the devices and analysis performed. A high correlation was found between the output from the printed and scanned images and the output from "real world" images.


Algorithms; Devices; Evaluate; Images; OCR; Performance; Synthesized

Controlled Subject

Computer science

File Format


File Size

1658.88 KB

Degree Grantor

University of Nevada, Las Vegas




If you are the rightful copyright holder of this dissertation or thesis and wish to have the full text removed from Digital Scholarship@UNLV, please submit a request to and include clear identification of the work, preferably with URL.


IN COPYRIGHT. For more information about this rights statement, please visit