"Study of document clustering using the k-means algorithm" by Meghna Sharma Gummuluru

UNLV Retrospective Theses & Dissertations

Title

Study of document clustering using the k-means algorithm

Author

Meghna Sharma Gummuluru, University of Nevada, Las Vegas

Award Date

1-1-2006

Degree Type

Thesis

Degree Name

Master of Science (MS)

Department

Computer Science

First Committee Member

Kazem Taghva

Number of Pages

Abstract

One of the most commonly used data mining techniques is document clustering or unsupervised document classification which deals with the grouping of documents based on some document similarity function; This thesis deals with research issues associated with categorizing documents using the k-means clustering algorithm which groups objects into K number of groups based on document representations and similarities; The proposed hypothesis of this thesis is to prove that unsupervised clustering of a set of documents produces similar results to that of their supervised categorization.

Keywords

Algorithms; Clustering; Document Means; Study

Controlled Subject

Computer science

File Format

pdf

File Size

1484.8 KB

Degree Grantor

University of Nevada, Las Vegas

Language

English

Permissions

If you are the rightful copyright holder of this dissertation or thesis and wish to have the full text removed from Digital Scholarship@UNLV, please submit a request to digitalscholarship@unlv.edu and include clear identification of the work, preferably with URL.

Repository Citation

Gummuluru, Meghna Sharma, "Study of document clustering using the k-means algorithm" (2006). UNLV Retrospective Theses & Dissertations. 2050.
http://dx.doi.org/10.25669/22cq-7e2w

Rights

IN COPYRIGHT. For more information about this rights statement, please visit http://rightsstatements.org/vocab/InC/1.0/

Download

COinS

Digital Scholarship@UNLV