Master of Science (MS)
First Committee Member
Number of Pages
One of the most commonly used data mining techniques is document clustering or unsupervised document classification which deals with the grouping of documents based on some document similarity function; This thesis deals with research issues associated with categorizing documents using the k-means clustering algorithm which groups objects into K number of groups based on document representations and similarities; The proposed hypothesis of this thesis is to prove that unsupervised clustering of a set of documents produces similar results to that of their supervised categorization.
Algorithms; Clustering; Document Means; Study
University of Nevada, Las Vegas
If you are the rightful copyright holder of this dissertation or thesis and wish to have the full text removed from Digital Scholarship@UNLV, please submit a request to email@example.com and include clear identification of the work, preferably with URL.
Gummuluru, Meghna Sharma, "Study of document clustering using the k-means algorithm" (2006). UNLV Retrospective Theses & Dissertations. 2050.