Master of Science (MS)
First Committee Member
Number of Pages
In this thesis; we report on our experiments on training and categorization of optically recognized documents. In, particular, we present a lexicon-based error correction algorithm to improve the categorization process. This algorithm is based on edit distance techniques and information from highly weighted words in the categorizers.
Categorization; Effects; Errors; OCR; Text
University of Nevada, Las Vegas
If you are the rightful copyright holder of this dissertation or thesis and wish to have the full text removed from Digital Scholarship@UNLV, please submit a request to firstname.lastname@example.org and include clear identification of the work, preferably with URL.
Mackovski, Lidija K, "Effects of OCR errors on text categorization" (2001). UNLV Retrospective Theses & Dissertations. 1331.