Which algorithm is used for sorting in MapReduce Hadoop

Free Online Certification Courses – Learn Today. Lead Tomorrow. Forums Apache Hadoop Which algorithm is used for sorting in MapReduce Hadoop

Viewing 1 reply thread
  • Author
    Posts
    • #5980
      DataFlair TeamDataFlair Team
      Spectator

      Which sorting algorithm is used in MapReduce in Hadoop?

    • #5981
      DataFlair TeamDataFlair Team
      Spectator

      Sorting in MapReduce is on Both Mapper Side as well as Reducer side.

      Mapper side(Quick Sort)
      The Mapper process the input key value pair from Record Reader and generates output as per custom Business logic, before writing the data the output is partitioned and sorted by key.

      Reducer side(Merge Sort):
      In Reducer phase the data from the Mapper is again sorted as per the key and sent to Reducer for processing data as per custom business logic.The shuffling and sorting in Reducer phase happens simultaneously.

Viewing 1 reply thread
  • You must be logged in to reply to this topic.