Autotag: A Tool for Creating Structured Document Collections from Printed Materials
Editors
Jacques André; Heather Brown; Roger D. Hersch
Document Type
Chapter
Publication Date
1998
Publication Title
Electronic Publishing, Artistic Imaging, and Digital Typography
Publisher
Springer Berlin Heidelberg
First page number:
420
Last page number:
431
Abstract
We report on the design and implementation of a system which automates the process of capturing structured documents from the optically recognized form of printed materials. The system is intended to be used to convert printed collections into their corresponding tagged electronic versions with little or no manual interventon. This conversion process has some unique problems associated with it, these are discussed, along with our attempts to solve them. This system also establishes a mapping between the bitmap image and its corresponding ASCII representation that can be used to design flexible image-based interfaces for IR-related applications.
Disciplines
Electrical and Computer Engineering | Engineering
Language
English
Permissions
Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.
Repository Citation
Taghva, K.,
Condit, A.,
Borsack, J.
(1998).
Autotag: A Tool for Creating Structured Document Collections from Printed Materials. In Jacques André; Heather Brown; Roger D. Hersch,
Electronic Publishing, Artistic Imaging, and Digital Typography
420-431.
Springer Berlin Heidelberg.