Information Access in the Presence of OCR Errors
Meeting name
1st ACM Workshop on Hardcopy Document Processing
Document Type
Conference Proceeding
Meeting location
Washington, D.C.
Publication Date
11-12-2004
Abstract
Over the last 15 years, the Information Science Research Institute (ISRI) at the University of Nevada, Las Vegas (UNLV) has conducted information access research in the presence of OCR errors. Our research has focused on issues associated with the construction of large document databases. In this paper, we will highlight our findings and detail our current activities.
Keywords
Categorization; Document conversion; Errors; Information extraction; Information retrieval; Markup; Optical character recognition
Disciplines
Computer Engineering | Databases and Information Systems | Electrical and Computer Engineering | Theory and Algorithms
Permissions
Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.
Repository Citation
Taghva, K.,
Nartker, T.,
Borsack, J.
(2004, November).
Information Access in the Presence of OCR Errors.
Presentation at 1st ACM Workshop on Hardcopy Document Processing,
Washington, D.C..
Available at: https://digitalscholarship.unlv.edu/ece_presentations/30