hadoop vs spark

Viewing 1 reply thread
  • Author
    Posts
    • #5462
      DataFlair TeamDataFlair Team
      Spectator

      What are the differences between Apache Hadoop & Spark?
      Comparison between Spark vs Hadoop.

    • #5464
      DataFlair TeamDataFlair Team
      Spectator

      Hadoop is designed for batch processing.Batch processing is very efficient in the processing of high volume data.
      Hadoop MapReduce is batch oriented processing tool, it takes large dataset in the input, processes it and produces a result.
      Hadoop MapReduce adopted batch oriented model.Batch is essentially processing data at rest, taking a large amount of data
      at once and producing output.MapReduce process is slower than spark because due to produce a lot of intermediary data.

      Spark also supports batch processing system as well as stream processing.
      Spark streaming processes data streams in micro batches, Micro batches are an essentially collect and then process kind of
      computational model.Spark processes faster than map reduce because it caches input data in memory by RDD.

      Please find more details in below link

      http://data-flair.training/blogs/spark-vs-flink-vs-hadoop-comparison

Viewing 1 reply thread
  • You must be logged in to reply to this topic.