Information Access in the Presence of OCR Errors

Meeting name

1st ACM Workshop on Hardcopy Document Processing

Document Type

Conference Proceeding

Meeting location

Washington, D.C.

Publication Date



Over the last 15 years, the Information Science Research Institute (ISRI) at the University of Nevada, Las Vegas (UNLV) has conducted information access research in the presence of OCR errors. Our research has focused on issues associated with the construction of large document databases. In this paper, we will highlight our findings and detail our current activities.


Categorization; Document conversion; Errors; Information extraction; Information retrieval; Markup; Optical character recognition


Computer Engineering | Databases and Information Systems | Electrical and Computer Engineering | Theory and Algorithms


Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

UNLV article access