What is SequenceFileInputFormat in MapReduce?

Free Online Certification Courses – Learn Today. Lead Tomorrow. Forums Apache Hadoop What is SequenceFileInputFormat in MapReduce?

Viewing 1 reply thread
  • Author
    Posts
    • #5543
      DataFlair TeamDataFlair Team
      Spectator

      What is SequenceFileInputFormat? For what it is used in Hadoop?

    • #5545
      DataFlair TeamDataFlair Team
      Spectator

      SequenceFileInputFormat in Hadoop is an InputFormat which reads sequence files. Sequence files are binary files that stores sequences of binary key-value pairs. Sequence files are block-compressed and provide direct serialization and deserialization of several arbitrary data types (not just text). Here Key & Value both are user-defined.

      Advantages:-

      1) More compact than text files
      2) Files can be split and processed in parallel
      3) It can be used as a container for large number of small files

      Disadvantages:-
      1) Temporary output of Mapper can be stored in sequential files

Viewing 1 reply thread
  • You must be logged in to reply to this topic.