Why does Hadoop need classes like Text or IntWritable?

This topic has 2 replies, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.

Viewing 2 reply threads

Author

Posts
- September 20, 2018 at 4:39 pm #5905
  
  DataFlair Team
  Spectator
  
  In Map-Reduce Why does Hadoop need classes like Text or IntWritable instead of String or Integer? Why we cannot use default java types ?
- September 20, 2018 at 4:39 pm #5907
  
  DataFlair Team
  Spectator
  
  In MapReduce there is special purpose datatype for key and value, e.g. instead of int IntWritable, instead of long LongWritable, instead of String Text.
  Actually, these keys and values need to travel across the network (from Mapper node to Reducer node), so special purpose datatypes are created which are serialized. Since this is a physical movement of data, it must be optimized.
  
  Even we can create a custom key and value:
  Key class must implement WritableComparable interface (must implement abstract methods of this interface)
  Value class must implement Writable interface (must implement abstract methods of this interface)
- September 20, 2018 at 4:40 pm #5908
  
  DataFlair Team
  Spectator
  
  In order to handle the Objects in Hadoop way. For example, Hadoop uses Text instead of Java’s String. The Text class in Hadoop is similar to a Java String, however, Text implements interfaces like Comparable, Writable and WritableComparable.
  
  These interfaces are all necessary for MapReduce; the Comparable interface is used for comparing when the reducer sorts the keys, and Writable can write the result to the local disk. It does not use the Java Serializable because java Serializable is too big or too heavy for Hadoop, Writable can serializable the Hadoop Object in a very light way.
Author

Posts

Viewing 2 reply threads

You must be logged in to reply to this topic.

Why does Hadoop need classes like Text or IntWritable?

About DataFlair

Trending Data Science Courses

Free Big Data Courses

Trending Programming Courses

Trending Web Dev Courses

Trending Courses

Trending Python Courses

Trending Java Courses

Trending DSA Courses