Big Data: Exploring Hadoop, MapReduce &amp; HDFS in Data Mining

Prof. Datey Zuhaib Khalil; Prof. M. Jhansi Lakshmi

doi:10.17148/IJARCCE.2016.58113

← Back to VOLUME 5, ISSUE 8, AUGUST 2016

Big Data: Exploring Hadoop, MapReduce & HDFS in Data Mining

Prof. Datey Zuhaib Khalil, Prof. M. Jhansi Lakshmi

DOI: 10.17148/IJARCCE.2016.58113

Abstract: The Global digital content created will increase some 30 times over the next ten years � to 35 zetta bytes, this unstoppable increase in data challenges business problems, a big data represents a large and rapidly growing volume of information that is mostly untapped by existing analytical applications and data warehousing systems. To analyze this enormous amount of data Hadoop can be used. Hadoop is an open source software project that enables the distributed processing of large data sets across clusters of commodity servers. It is designed to scale up from a single server to thousands of machines, with a very high degree of fault tolerance. The technologies used by big data application to handle the massive data are Hadoop, Map Reduce, Apache Hive, No SQL and HPCC. This paper highlights the big data and the new method of hadoop, MapReduce and HDFS to tackle the problem of big data.

Keywords: Data Mining, Big data, Structured data, BI, Big Data analytics, OLAP, EDA, Neural Networks, Hadoop and MapReduce technique, Advantages, Disadvantages.

Downloads: Download PDF|DOI: 10.17148/IJARCCE.2016.58113

How to Cite:

[1] Prof. Datey Zuhaib Khalil, Prof. M. Jhansi Lakshmi, “Big Data: Exploring Hadoop, MapReduce & HDFS in Data Mining,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.58113

Big Data: Exploring Hadoop, MapReduce &amp; HDFS in Data Mining

How to Cite:

Big Data: Exploring Hadoop, MapReduce & HDFS in Data Mining