This video tutorial explains basics of Apache Spark. This quick start takes you through Apache Spark, Spark terminologies, RDDs, Transformations, Actions, internals of spark architecture, basics of spark streaming and spark sql.
Below is the list of topics covered in the tutorial
- What and why of Big Data
- Different Big Data solutions available on the planet
- comparison between different big data technologies
- limitations with MapReduce.
- Need of Apache spark – general purpose cluster computing engine
- Introduction to Apache Spark
- Spark features and internals of Spark architecture
- Spark ecosystem – Spark core, Spark streaming, Spark SQL, MLlib, GraphX
- Spark abstraction – RDDs
- Transformation and Action.