International Journal of Advanced Research in Computer and Communication Engineering

A monthly peer-reviewed online and print journal

ISSN Online 2278-1021
ISSN Print 2319-5940

Abstract: Mining of data from large data sets and the process of discovering patterns using statistics, machine learning, data correlation, data plotting or data visualization and data evaluation are called data mining. Data analytics and data mining are a subset of Business Intelligence (BI). [1] In our previous paper titled “Data Analytics: Employee Turnover in a Company-1” the process of data pre-processing was demonstrated by writing a program in Python. Libraries like pandas, numpy, seaborn and matplotlib [2] of Python provide platform for computing, evaluation and visualization of acquired data. In this paper we demonstrate three analytical tools- plotting and evaluating, correlation and data prediction/Machine learning which are involved in data mining and analytics of company’s data. The company wants to understand the factors contributing to employee turnover and to think of various retention strategies.

Keywords: Python, analytical tools- plotting and evaluating, correlation and data prediction/Machine learning


PDF | DOI: 10.17148/IJARCCE.2018.71112