International Journal of Advanced Research in Computer and Communication Engineering

A monthly peer-reviewed online and print journal

ISSN Online 2278-1021
ISSN Print 2319-5940

Abstract: Data mining could be a method of extracting desired and helpful data from the pool of information. Cluster in data processing is that the grouping of information points with some common similarity. Cluster is a vital aspect of information mining. It simply clusters the information sets into given no. of clusters. Various no. of ways are used for the information cluster among that K- suggests that is that the most generally used cluster formula. During this paper we've briefed within the kind of a review work done by completely different researcher’s victimization K-means cluster formula. As a partition primarily based cluster algorithmic program, K-Means is wide employed in several areas for the options of its efficiency and simply understood. However, it's documented that the K-Means algorithmic program could get suboptimal solutions, looking on the selection of the initial cluster centers. During this paper, they propose a projection-based K-Means low-level formatting formula. The planned formula initial use standard mathematician kernel density estimation technique to search out the extremely density information areas in one dimension. Then the projection step is to iteratively use density estimation from the lower variance dimensions to the upper variance ones till all the scale square measure computed. Experiments on actual datasets show that our technique will get similar results compared with different standard ways with fewer computation tasks.

Keywords: Data Mining, Data Sets, Clustering, Clustering Method, K-means Clustering, Unsupervised Learning


PDF | DOI: 10.17148/IJARCCE.2019.8323