Electrical & Computer Engineering Faculty Research

Name Extraction and Formal Concept Analysis

Kazem Taghva, University of Nevada, Las VegasFollow
Russell Beckley, University of Nevada, Las Vegas
Jeffrey Coombs, University of Nevada, Las VegasFollow

Document Type

Chapter

Publication Date

2011

Publication Title

Conceptual Structures for Discovering Knowledge

Publisher

Springer Berlin Heidelberg

First page number:

339

Last page number:

345

Abstract

Many applications of Formal Concept Analysis (FCA) start with a set of structured data such as objects and their properties. In practice, most of the data which is readily available are in the form of unstructured or semistructured text. A typical application of FCA assumes the extraction of objects and their properties by some other methods or techniques. For example, in the 2003 Los Alamos National Lab (LANL) project on Advanced Knowledge Integration In Assessing Terrorist Threats, a data extraction tool was used to mine the text for the structured data. In this paper, we provide a detailed description of our approach to extraction of personal names for possible subsequent use inFCA. Our basic approach is to integrate statistics on names and other words into an adaptation of a Hidden Markov Model (HMM). We use lists of names and their relative frequencies compiled from U.S. Census data. We also use a list of non-name words along with their frequencies in a training set from our collection of documents. These lists are compiled into one master list to be used as a part of the design.

Disciplines

Electrical and Computer Engineering | Engineering

Language

English

Permissions

Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

Repository Citation

Taghva, K., Beckley, R., Coombs, J. (2011). Name Extraction and Formal Concept Analysis. Conceptual Structures for Discovering Knowledge 339-345. Springer Berlin Heidelberg.

UNLV article access

COinS

Digital Scholarship@UNLV

Electrical & Computer Engineering Faculty Research

Name Extraction and Formal Concept Analysis

Document Type

Publication Date

Publication Title

Publisher

First page number:

Last page number:

Abstract

Disciplines

Language

Permissions

Repository Citation

Browse

Links

Digital Scholarship@UNLV

Electrical & Computer Engineering Faculty Research

Name Extraction and Formal Concept Analysis

Authors

Document Type

Publication Date

Publication Title

Publisher

First page number:

Last page number:

Abstract

Disciplines

Language

Permissions

Repository Citation

Share

Browse

Links