apache spark

Apache spark single node installation

Apache spark single node installation

 

Apache Spark is an open-source cluster computing framework originally developed in the AMPLab at UC Berkeley. In contrast to Hadoop‘s two-stage disk-based MapReduce paradigm, Spark’s in-memory primitives provide performance up to 100 times faster for certain applications.

Click here more video

Step 1 Download Spark any latest version

spark

Step 2 Download Scala any latest version

scala

Step 3 Download Java

Click here

NOTE install git Go to –>terminal –> sudo apt-get insatll git 

Step 4 Untar Spark , Scala , Jdk

spark

spark

spark

Step 5 Set the environment path in .bashrc

spark

Step 6 Start building spark using sbt

spark

spark

Step 7 Start spark shell

spark