Academic Journals Database
Disseminating quality controlled scientific knowledge

Comparative Study of K-means and Bisecting k-means Techniques in Wordnet Based Document Clustering

Author(s): B.S.Vamsi Krishna | Sr.Assistant Professor, CSE, MVGR College of Engineering,Chintalavalasa,Vizianagaram,Andhrapradesh,India. | Suneel Kumar R

Journal: International Journal of Engineering and Advanced Technology
ISSN 2249-8958

Volume: 1;
Issue: 6;
Start page: 229;
Date: 2012;
VIEW PDF   PDF DOWNLOAD PDF   Download PDF Original page

Keywords: bisecting k-means | document clustering | standatd k-means | wordnet.

Document clustering plays major role in the fast developing information explosion. It is considered as tool for performing information based operations. Document clustering generates clusters from whole document collection automatically and used in many fields. It is the process of grouping text documents into category groups. It has found applications in various domains in information retrieval and web information systems. Ontology-based computing is considered as a natural evolution of existing technologies to cope with the information onslaught. In current paper, background knowledge derived from Word Net as Ontology is applied during preprocessing of documents for Document Clustering. Document vectors constructed from WordNet Synsets is used as input for clustering. Comparative analysis is done between clustering using k-means and clustering using bi-secting k-means. Results indicate that the bi-secting k-means clustering technique is better than standard k-means clustering technique. These results based on the analysis of specifics of clustering algorithm and nature of document data.
Save time & money - Smart Internet Solutions      Why do you need a reservation system?