Una breve aproximación a Spark

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

With the appearance of the internet and mobile devices, as well as an increase in the number of sensors to control a very diverse number of devices, the companies and users that manipulate this data were lacking a tool to analyze them, so companies As Google, they began to develop tools to analyze the large volume of information they generated using distributed computing. In recent years one of the most popular tools has been Apache Spark, which, while retaining the good characteristics of its predecessors and with new readjustments and concepts, has improved performance during the computations. This work briefly describes the characteristics of the Spark framework, details how it is installed, and then shows some examples of its use.

Description

Citation

DOI