What is Identity reducer?

This topic has 2 replies, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.

Viewing 2 reply threads

Author

Posts
- September 20, 2018 at 5:12 pm #6140
  
  DataFlair Team
  Spectator
  
  Explain Identity reducer in MapReduce?
- September 20, 2018 at 5:12 pm #6143
  
  DataFlair Team
  Spectator
  
  Identity Reducer is the default reducer in Hadoop old API. When no reducer class is set by job.setReducerClass() method in Driver class, Identity reducer is used as the default reducer.
  
  It doesn’t provide any processing on the input, it will flush whatever input key value pair is fed to it as output.
- September 20, 2018 at 5:12 pm #6145
  
  DataFlair Team
  Spectator
  
  Identity Reducer is one of the few predefined classes provided by Hadoop.
  IdentityReducer API is available under org.apache.hadoop.mapred.lib package.
  It will be invoked by default if no Reducer class mentioned in Driver class of MapReduce job.
  The input Key, Value pairs are just dumped into output as it is without any aggregation, except the data is sorted based on the key.
  Here, shuffle & sort happens except the aggregation. Hence, Identity Reducer is used if you only want to sort the input data from map output.
Author

Posts

Viewing 2 reply threads

You must be logged in to reply to this topic.