📞 +91-7667918914 | ✉️ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 5, ISSUE 7, JULY 2016

Hadoop-GPU Based K-Means for Data Clustering

Parag Pachouri, Prof. Manaswini Panigrahi

DOI: 10.17148/IJARCCE.2016.57103

Abstract: For achieving data parallelism in Apache Hadoop, MapReduce is the most prominent programming model. Lots of efforts are attempting for boost the computational speed of MapReduce in Hadoop framework. In this paper, we present a MapReduce programming model focused on the Kmeans clustering algorithms that leverage the acceleration potential of the integrated GPU in a multi-node cluster environment. It accelerated the framework by providing intra parallelism between the MapReduce function by using modified k-means algorithm. Based on various experiments on multi node cluster and depth analysis, we find that utilizing of the integrated GPU via OpenCL offers significant performance and power efficiency gains over the original CPU based or sequential approaches.



Keywords: Hadoop, Map/Reduce, OpenCL and KMeans.

How to Cite:

[1] Parag Pachouri, Prof. Manaswini Panigrahi, “Hadoop-GPU Based K-Means for Data Clustering,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.57103