Autotag: A Tool for Creating Structured Document Collections from Printed Materials

Editors

Jacques André; Heather Brown; Roger D. Hersch

Document Type

Chapter

Publication Date

1998

Publication Title

Electronic Publishing, Artistic Imaging, and Digital Typography

Publisher

Springer Berlin Heidelberg

First page number:

420

Last page number:

431

Abstract

We report on the design and implementation of a system which automates the process of capturing structured documents from the optically recognized form of printed materials. The system is intended to be used to convert printed collections into their corresponding tagged electronic versions with little or no manual interventon. This conversion process has some unique problems associated with it, these are discussed, along with our attempts to solve them. This system also establishes a mapping between the bitmap image and its corresponding ASCII representation that can be used to design flexible image-based interfaces for IR-related applications.

Disciplines

Electrical and Computer Engineering | Engineering

Language

English

Permissions

Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

UNLV article access

Share

COinS