What is parquet file format in spark?

Viewing 1 reply thread
  • Author
    Posts
    • #4969
      DataFlair Team
      Moderator

      What is Parquet file format ? Where Parquet format should be used ? how to convert data to Parquet format ?

    • #4970
      DataFlair Team
      Moderator

      Parquet is the columnar information illustration that is that the best choice for storing long run massive information for analytics functions. It will perform each scan and write operations with Parquet file. Parquet could be a columnar information storage format.

      Parquet is created to urge the benefits of compressed, economical columnar information illustration accessible to any project, despite the selection of knowledge process framework, data model, or programming language.

      Parquet could be a format which will be processed by variety of various systems: Spark-SQL, Impala, Hive, Pig, niggard etc. It doesn’t lock into a particular programming language since the format is outlined exploitation, Thrift that supports numbers of programming languages. as an example, Aepyceros melampus is written in C++ whereas Hive is written in Java however they will simply interoperate on an equivalent Parquet information.

Viewing 1 reply thread
  • You must be logged in to reply to this topic.