This topic contains 1 reply, has 1 voice, and was last updated by  dfbdteam3 1 year, 6 months ago.

Viewing 2 posts - 1 through 2 (of 2 total)
  • Author
    Posts
  • #5232

    dfbdteam3
    Moderator

    How much space the split will occupy?

    #5234

    dfbdteam3
    Moderator

    Input split is the logical representation of Block size. One split is usually equal to one block. But block size and split size can be customized. InputFormat.getSplitts() is responsible for creating the splits which are used as input my Mapper. One mapper can process one split at a time.
    Consider block size of 128 Mb. Suppose a large file comes and is broken into blocks. One block can end in between a line but the split will consider this and it ends at the completion of the previous line.

    To learn more in detail follow: Input split in Hadoop

Viewing 2 posts - 1 through 2 (of 2 total)

You must be logged in to reply to this topic.