Fuzzy Information Extraction on OCR Text
2011 Eighth International Conference on Information Technology: New Generations
Las Vegas, NV
In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.
Bills of materials; Data mining; Date of birth identification; Duality; Edit distance; Fuzzy information extraction; Fuzzy set theory; HTML; Information extraction; Information retrieval; OCR; OCR text; Optical character recognition; Optical character recognition software; Pattern matching; Patterns; Relations; Testing; Text editing; Training data
Computer Engineering | Electrical and Computer Engineering | Software Engineering
Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.
Fuzzy Information Extraction on OCR Text.
Presentation at 2011 Eighth International Conference on Information Technology: New Generations,
Las Vegas, NV.
Available at: http://digitalscholarship.unlv.edu/ece_presentations/18