Title

Fuzzy Information Extraction on OCR Text

Meeting name

2011 Eighth International Conference on Information Technology: New Generations

Meeting location

Las Vegas, NV

Document Type

Conference Proceeding

Publication Date

4-11-2011

Description

In this paper, we report on two experiments on identification and extraction of Date of Birth instances. The objective of these experiments is to increase the recall level by increasing the edit distance while obtaining a reasonable precision.

Keywords

Bills of materials; Data mining; Date of birth identification; Duality; Edit distance; Fuzzy information extraction; Fuzzy set theory; HTML; Information extraction; Information retrieval; OCR; OCR text; Optical character recognition; Optical character recognition software; Pattern matching; Patterns; Relations; Testing; Text editing; Training data

Disciplines

Computer Engineering | Electrical and Computer Engineering | Software Engineering

Permissions

Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

Identifier

DOI: 10.1109/ITNG.2011.99