International Journal of Advanced Research in Computer and Communication Engineering

A monthly peer-reviewed online and print journal

ISSN Online 2278-1021
ISSN Print 2319-5940

Abstract: Web mining is a process in which algorithms are written to analyse or discover patterns from the World Wide Web. Web mining may include web content mining, web structure mining or web usage mining. In this paper we have put in efforts to extract/mine data from the IMDb website-a leading movie website guide for watching movies, listening to music, watching TV shows, celebrity gossips and much more. We have written an algorithm/used a web scraper Bot by virtue of which the database of a particular year or name or IMDb rating is extracted in few seconds and is displayed in the CSV format. Various calculations and analysis can be further carried out after extraction of crude data from the website and converted to useful format. Efforts are also made to store the data in both processed and unprocessed format for future applications.

Keywords: Analyse or discover patterns from the World Wide Web, extract/mine data from the IMDb website, crude data from the website and converted to useful format, CSV format, store the data in both processed and unprocessed format


PDF | DOI: 10.17148/IJARCCE.2018.7717