Award Date

1-1-1993

Degree Type

Thesis

Degree Name

Master of Science (MS)

Department

Computer Science

Number of Pages

54

Abstract

This thesis will attempt to establish if synthesized images can be used to predict the performance of Optical Character Recognition (OCR) algorithms and devices. The value of this research lies in reducing the considerable costs associated with preparing test images for OCR research. The paper reports on a series of experiments in which synthesized images of text files in nine different fonts and sizes are input to eight commercial OCR devices. The method used to create the images is explained and a detailed analysis of the character and word confusion between the output and the true text files is presented. The synthesized images are then printed and scanned to mechanically introduce "noise". The resulting images are also input to the devices and analysis performed. A high correlation was found between the output from the printed and scanned images and the output from "real world" images.

Keywords

Algorithms; Devices; Evaluate; Images; OCR; Performance; Synthesized

Controlled Subject

Computer science

File Format

pdf

File Size

1658.88 KB

Degree Grantor

University of Nevada, Las Vegas

Language

English

Permissions

If you are the rightful copyright holder of this dissertation or thesis and wish to have the full text removed from Digital Scholarship@UNLV, please submit a request to digitalscholarship@unlv.edu and include clear identification of the work, preferably with URL.

Identifier

https://doi.org/10.25669/6m80-ciqn


Share

COinS