Fuzzy Information Extraction on OCR Text

Meeting name

2011 Eighth International Conference on Information Technology: New Generations

Document Type

Conference Proceeding

Meeting location

Las Vegas, NV

Publication Date



In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.


Bills of materials; Data mining; Date of birth identification; Duality; Edit distance; Fuzzy information extraction; Fuzzy set theory; HTML; Information extraction; Information retrieval; OCR; OCR text; Optical character recognition; Optical character recognition software; Pattern matching; Patterns; Relations; Testing; Text editing; Training data


Computer Engineering | Electrical and Computer Engineering | Software Engineering


Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

UNLV article access