What is SequenceFileInputFormat in MapReduce?
-
-
What is SequenceFileInputFormat? For what it is used in Hadoop?
-
SequenceFileInputFormat in Hadoop is an InputFormat which reads sequence files. Sequence files are binary files that stores sequences of binary key-value pairs. Sequence files are block-compressed and provide direct serialization and deserialization of several arbitrary data types (not just text). Here Key & Value both are user-defined.
Advantages:-
1) More compact than text files
2) Files can be split and processed in parallel
3) It can be used as a container for large number of small files
Disadvantages:-
1) Temporary output of Mapper can be stored in sequential files
- You must be logged in to reply to this topic.