📞 +91-7667918914 | âœ‰ī¸ ijarcce@gmail.com
IJARCCE Logo
International Journal of Advanced Research in Computer and Communication Engineering A monthly Peer-reviewed & Refereed journal
ISSN Online 2278-1021ISSN Print 2319-5940Since 2012
IJARCCE adheres to the suggestive parameters outlined by the University Grants Commission (UGC) for peer-reviewed journals, upholding high standards of research quality, ethical publishing, and academic excellence.
← Back to VOLUME 5, ISSUE 4, APRIL 2016

A Framework for Sharing Computations and Data Stores for Big Data Applications within and Across Organizations

Amandeep Gupta, Pritish Mukherjee, Anuj Mahajan, Vishal Pandey

DOI: 10.17148/IJARCCE.2016.54165

Abstract: Current business practices use separate systems to perform computations on data sets which may be from various sources; however, individuals or small organizations may lack this ability. The reuse of intermediate results across further computations is an important class of emerging applications. This paper aims to tackle this issue regarding sharing data between different organizations/applications and thereby optimizing their computations. In addition to sharing large amounts of data, we can share the intermediary/preliminary results from the data pipeline to various organizations. Any organization when handling data involves ETL steps for collecting data, pre-processing and then perform computations on it. If another organization wishes to work with the same data set, it has to repeat the collection, pre-processing and computation process. The proposed system can help organizations/end-users by providing intermediate computation results after performing ETL steps and basic processing on our end. These computation results are available for use by other organizations/individuals for further application specific processing using REST APIs or some other way.



Keywords: shared data store, shared computation, big data, sentiment analysis, image tagging, healthcare.

How to Cite:

[1] Amandeep Gupta, Pritish Mukherjee, Anuj Mahajan, Vishal Pandey, “A Framework for Sharing Computations and Data Stores for Big Data Applications within and Across Organizations,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.54165