Arabic Stemming Without A Root Dictionary

Meeting name

International Conference on Information Technology: Coding and Computing

Document Type

Conference Proceeding

Publication Date

4-4-2005

Abstract

We have implemented a root-extraction stemmer for Arabic which is similar to the Khoja stemmer but without a root dictionary. Our stemmer was found to perform equivalently to the Khoja stemmer as well as so-called "light" stemmers in monolingual document retrieval tasks performed on the Arabic Trec-2001 collection. A root dictionary, therefore, does not improve Arabic monolingual document retrieval.

Keywords

Arabic Trec-2001 collection; Arabic monolingual document retrieval; Arabic language; Arabic stemming; Dictionaries; Information retrieval; Khoja stemmer; Light stemmers; Natural language processing (Computer science); Natural languages; Root dictionary; Root-extraction stemmer

Disciplines

Computer Engineering | Computer Sciences | Electrical and Computer Engineering | Theory and Algorithms

Permissions

Use Find in Your Library, contact the author, or interlibrary loan to garner a copy of the item. Publisher policy does not allow archiving the final published version. If a post-print (author's peer-reviewed manuscript) is allowed and available, or publisher policy changes, the item will be deposited.

UNLV article access

Share

COinS