How much faster is Apache spark than Hadoop?

This topic has 2 replies, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.

Viewing 2 reply threads

Author

Posts
- September 20, 2018 at 9:23 pm #6366
  
  DataFlair Team
  Spectator
  
  How fast spark can process the data as compared to Hadoop?
- September 20, 2018 at 9:23 pm #6367
  
  DataFlair Team
  Spectator
  
  Apache Spark works faster when the data fits into memory, Spark processes data in memory which makes it faster in processing while MapReduce pushes data to disk after processing it. Usage of DAG helps to do a lot of optimization, it can optimize and do computations in a single stage, and it also avoids unwanted reducer tasks. Spark can cache partial or complete data in memory allowing to avoid a lot of disks I/O. Commercially Spark is said to 100x faster than Hadoop.
- September 20, 2018 at 9:23 pm #6368
  
  DataFlair Team
  Spectator
  
  Apache Spark works faster when the data resides in memory, Spark processes data in memory which makes it faster in processing. While MapReduce pushes data to disk after processing it. Commercially Spark is said to 100x faster in a memory based computation and 100 times faster in a memory based computation.
Author

Posts

Viewing 2 reply threads

You must be logged in to reply to this topic.