SciELO - Scientific Electronic Library Online

 
vol.24 número2On Detecting Keywords for Concept Mapping in Plain TextAn LSTM Based Time Series Forecasting Framework for Web Services Recommendation índice de autoresíndice de assuntospesquisa de artigos
Home Pagelista alfabética de periódicos  

Serviços Personalizados

Journal

Artigo

Indicadores

Links relacionados

  • Não possue artigos similaresSimilares em SciELO

Compartilhar


Computación y Sistemas

versão On-line ISSN 2007-9737versão impressa ISSN 1405-5546

Resumo

KETU, Shwet; MISHRA, Pramod Kumar  e  AGARWAL, Sonali. Performance Analysis of Distributed Computing Frameworks for Big Data Analytics: Hadoop Vs Spark. Comp. y Sist. [online]. 2020, vol.24, n.2, pp.669-686.  Epub 04-Out-2021. ISSN 2007-9737.  https://doi.org/10.13053/cys-24-2-3401.

In the last one decade, the tremendous growth in data emphasizes big data storage and management issues with the highest priorities. For providing better support to software developers for dealing with big data problems, new programming platforms are continuously developing and Hadoop MapReduce is a big game-changer followed by Spark, which sets the world of big data on fire with its processing speed and comfortable APIs. Hadoop framework emerged as a leading tool based on the MapReduce programming model with a distributed file system. Spark is on the other hand, recently developed big data analysis and management framework used to explore unlimited underlying features of Big Data. In this research work, a comparative analysis of Hadoop MapReduce and Spark has been presented based on working principle, performance, cost, ease of use, compatibility, data processing, failure tolerance, and security. Experimental analysis has been performed to observe the performance of Hadoop MapReduce and Spark for establishing their suitability under different constraints of the distributed computing environment.

Palavras-chave : Big data; parallel processing; distributed environments; distributed frameworks; Hadoop MapReduce; Spark; big data analytics.

        · texto em Inglês     · Inglês ( pdf )