What is SparkContext in Apache Spark?

This topic has 1 reply, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.

Viewing 1 reply thread

Author

Posts
- September 20, 2018 at 3:52 pm #5656
  
  DataFlair Team
  Spectator
  
  What is SparkContext in Apache Spark?
  What is the need of SparkContext?
  What are the responsibilities of SparkContext?
- September 20, 2018 at 3:52 pm #5657
  
  DataFlair Team
  Spectator
  
  A SparkContext is a client of Spark’s execution environment and it acts as the master of the Spark application. SparkContext sets up internal services and establishes a connection to a Spark execution environment. You can create RDDs, accumulators and broadcast variables, access Spark services and run jobs (until SparkContext stops) after the creation of SparkContext. Only one SparkContext may be active per JVM. You must stop() the active SparkContext before creating a new one.
  
  In Spark shell, a special interpreter-aware SparkContext is already created for the user, in the variable called sc.
  
  The first step of any Spark driver application is to create a SparkContext. The SparkContext allows the Spark driver application to access the cluster through a resource manager. The resource manager can be YARN, or Spark’s Cluster Manager.
  
  Few functionalities which SparkContext offers are:
  1. We can get the current status of a Spark application like configuration, app name.
  2. We can set Configuration like master URL, default logging level.
  3. One can create Distributed Entities like RDDs.
  
  For detailed knowledge on SparkContext read SparkContext In Apache Spark.
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.

What is SparkContext in Apache Spark?

About DataFlair

Trending Data Science Courses

Free Big Data Courses

Trending Programming Courses

Trending Web Dev Courses

Trending Courses

Trending Python Courses

Trending Java Courses

Trending DSA Courses