Fuzzy Information Extraction on OCR Text
Meeting name
2011 Eighth International Conference on Information Technology: New Generations
Document Type
Conference Proceeding
Meeting location
Las Vegas, NV
Publication Date
4-11-2011
Abstract
In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.
Keywords
Bills of materials; Data mining; Date of birth identification; Duality; Edit distance; Fuzzy information extraction; Fuzzy set theory; HTML; Information extraction; Information retrieval; OCR; OCR text; Optical character recognition; Optical character recognition software; Pattern matching; Patterns; Relations; Testing; Text editing; Training data
Disciplines
Computer Engineering | Electrical and Computer Engineering | Software Engineering
Permissions
Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.
Repository Citation
Pereda, R.,
Taghva, K.
(2011, April).
Fuzzy Information Extraction on OCR Text.
Presentation at 2011 Eighth International Conference on Information Technology: New Generations,
Las Vegas, NV.
Available at: https://digitalscholarship.unlv.edu/ece_presentations/18