Una breve aproximación a Spark
Date
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
With the appearance of the internet and mobile devices, as well as an increase in the number of sensors to control a very diverse number of devices, the companies and users that manipulate this data were lacking a tool to analyze them, so companies As Google, they began to develop tools to analyze the large volume of information they generated using distributed computing. In recent years one of the most popular tools has been Apache Spark, which, while retaining the good characteristics of its predecessors and with new readjustments and concepts, has improved performance during the computations. This work briefly describes the characteristics of the Spark framework, details how it is installed, and then shows some examples of its use.