What is Spark SQL ?

This topic has 1 reply, 1 voice, and was last updated 5 years, 6 months ago by DataFlair Team.

Viewing 1 reply thread

Author

Posts
- September 20, 2018 at 2:08 pm #5064
  
  DataFlair Team
  Spectator
  
  Spark SQL is a Spark interface to work with Structured and Semi-Structured data (data that as defined fields i.e. tables). It provides abstraction layer called DataFrame and DataSet through with we can work with data easily. One can say that DataFrame is like a table in a relational database. Spark SQL can read and write data in a variety of Structured and Semi-Structured formats like Parquets, JSON, Hive. Using SparkSQL inside Spark application is the best way to use it. This empowers us to load data and query it with SQL. we can also combine it with “regular” program code in Python, Java or Scala.
  
  For detailed study on SparkSQL, Refer link: Spark SQL
- September 20, 2018 at 2:08 pm #5065
  
  DataFlair Team
  Spectator
  
  Spark sql is a module in apache spark for structured and semi structured data processing.The interface provided by spark sql provides spark more information about the structure of data and computation being performed on the data.It integrates relational processing with spark’s functional programming.It offers much tighter integration of relational processing with procedural processing through declarative dataframe API’s. Dataframe api’s and Dataset api’s are the way to interact with spark sql.
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.