What is TextInputFormat in MapReduce?

Viewing 2 reply threads
  • Author
    Posts
    • #5163
      DataFlair TeamDataFlair Team
      Spectator

      What is TextInputFormat? For what it is used in MapReduce?
      What is TextInputFormat in Hadoop MapReduce?

    • #5164
      DataFlair TeamDataFlair Team
      Spectator

      InputFormat defines the splits of input file it is the first component of Map-Reduce,
      since Map task depends on the data , InputFormat defines the size of the Map task and also the defines the RecordReader

      TextInputFormat is the default InputFormat, its for the plain text files, where each line is considered as records.
      key- Byte offset
      value – Rest of the line

    • #5166
      DataFlair TeamDataFlair Team
      Spectator

      Input Format defines how input file is going to be read and split up.So TextInputFormat is the default InputFormat in which each line is treated as a Separate record.key (a Long Writable) is the byte offset within the file of the beginning of the line and value is the contents of the line( excluding any line terminators).

Viewing 2 reply threads
  • You must be logged in to reply to this topic.