Difference between Job and Task in MapReduce Hadoop

This topic has 1 reply, 1 voice, and was last updated 7 years, 10 months ago by DataFlair Team.

Viewing 1 reply thread

Author

Posts
- September 20, 2018 at 2:39 pm #5238
  
  DataFlair Team
  Spectator
  
  What is the difference between Job and task?
  Comparison between MapReduce job vs task
- September 20, 2018 at 2:39 pm #5241
  
  DataFlair Team
  Spectator
  
  MapReduce is the data processing layer of Hadoop. It is the framework for writing applications that process the vast amount of data stored in the HDFS.
  In Hadoop, Job is divided into multiple small parts known as Task.
  
  In Hadoop, “MapReduce Job” splits the input dataset into independent chunks which are processed by the “Map Tasks” in a completely parallel manner. Hadoop framework sorts the output of the map, which are then input to the reduce tasks.
  Both the input and output of the job is stored in a filesystem. Hadoop framework deals with task scheduling, monitoring, and re-executing the failed tasks.
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.