hadoop vs spark

Viewing 1 reply thread
  • Author
    • #5462
      DataFlair TeamDataFlair Team

      What are the differences between Apache Hadoop & Spark?
      Comparison between Spark vs Hadoop.

    • #5464
      DataFlair TeamDataFlair Team

      Hadoop is designed for batch processing.Batch processing is very efficient in the processing of high volume data.
      Hadoop MapReduce is batch oriented processing tool, it takes large dataset in the input, processes it and produces a result.
      Hadoop MapReduce adopted batch oriented model.Batch is essentially processing data at rest, taking a large amount of data
      at once and producing output.MapReduce process is slower than spark because due to produce a lot of intermediary data.

      Spark also supports batch processing system as well as stream processing.
      Spark streaming processes data streams in micro batches, Micro batches are an essentially collect and then process kind of
      computational model.Spark processes faster than map reduce because it caches input data in memory by RDD.

      Please find more details in below link


Viewing 1 reply thread
  • You must be logged in to reply to this topic.