Information Access in the Presence of OCR Errors

Meeting name

1st ACM Workshop on Hardcopy Document Processing

Document Type

Conference Proceeding

Meeting location

Washington, D.C.

Publication Date

11-12-2004

Abstract

Over the last 15 years, the Information Science Research Institute (ISRI) at the University of Nevada, Las Vegas (UNLV) has conducted information access research in the presence of OCR errors. Our research has focused on issues associated with the construction of large document databases. In this paper, we will highlight our findings and detail our current activities.

Keywords

Categorization; Document conversion; Errors; Information extraction; Information retrieval; Markup; Optical character recognition

Disciplines

Computer Engineering | Databases and Information Systems | Electrical and Computer Engineering | Theory and Algorithms

Permissions

Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

UNLV article access

Share

COinS