Big Data Portable

Photo by ian dooley on Unsplash

En este tutorial vamos a usar Vagrant para iniciar un micro clúster con HDFS y Apache Spark con tan solo 3 GB en RAM dentro de una maquina virtual aprovisionada por dicha tecnología y lo mejor de todo con solo pocos pasos y cero configuración.

Para seguir el actual tutorial es necesario que sigas el anterior donde se instalan todos los requisitos previos:

--

--

--

Father-Husband-Data Scientist-Philosopher-Entrepreneur-Professor PhD c. in Data Science-MSc Stats #R #Scala #Spark #SatelliteImagery #Python #BigData #Nerd

Love podcasts or audiobooks? Learn on the go with our new app.

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Abel Coronado

Abel Coronado

Father-Husband-Data Scientist-Philosopher-Entrepreneur-Professor PhD c. in Data Science-MSc Stats #R #Scala #Spark #SatelliteImagery #Python #BigData #Nerd

More from Medium

Hive Query Optimization on an Amazon Dataset

An image of an office where employees are analyzing data to improve performance.

Data Engineer vs. Data Scientist

What is Hadoop? Top Hadoop Interview Questions and answers 2022

Top Hadoop Interview Questions and answers 2022

Databricks Certified Associate Developer — Apache Spark 3.x