Spark Tutorial: Introduction to BigData Analytics with Apache Spark Part 1
Introduction to BigData Analytics with Apache Spark Part 1
By Fadi Maalouli and R.H.
Spark Overview
Apache Spark, an open source cluster computing system, is growing fast. Apache Spark has a growing ecosystem of libraries and framework to enable advanced data analytics. Apache Spark’s rapid success is due to its power and and ease-of-use. It is more productive and has faster runtime than the typical MapReduce BigData based analytics. Apache Spark provides in-memory, distributed computing. It has APIs in Java, Scala, Python, and R. The Spark Ecosystem is shown below.