In MapReduce how to change the name of output file from part-r-00000

This topic has 1 reply, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.

Viewing 1 reply thread

Author

Posts
- September 20, 2018 at 11:44 am #4684
  
  DataFlair Team
  Spectator
  
  I want to change the name of output file of MapReduce Job, rather than part-*, I want to give a custom name. Actually I want to map the output file name with the hive table name.
- September 20, 2018 at 11:44 am #4685
  DataFlair Team
  Spectator
  By following 2 ways we can change the name of output file from part-r-00000:
  
  1. Using a Java class that derives from MultipleOutputFormat as the jobs output format allows control of the output file names.
```
// job.setOutputFormatClass(TextOutputFormat.class);
LazyOutputFormat.setOutputFormatClass(job, TextOutputFormat.class);
MultipleOutputs.addNamedOutput(job,“text”, TextOutputFormat.class,
Text.class, IntWritable.class);
```
  2. Using (this I have tested and working)
  job.getConfiguration().set(“mapreduce.output.basename”, “text”);
  part name will change and file will be created as text-r-00000
  
  For more details, please follow: MapReduce Tutorial
Author

Posts

Viewing 1 reply thread

You must be logged in to reply to this topic.