Why do we need Hadoop?

Viewing 2 reply threads
  • Author
    Posts
    • #5425
      DataFlair TeamDataFlair Team
      Spectator

      Why Hadoop is needed?
      Why Hadoop is Important in Handling Big Data?
      Why Hadoop is used?

    • #5426
      DataFlair TeamDataFlair Team
      Spectator

      Apache Hadoop is a platform for data storage and processing.The data from social media sites like facebook , twitter and shopping sites etc has increased tremendously in the past few years. Our traditional system(RDBMS) has failed to store
      and process these data.

      Advantages of using Hadoop are-

      1) It stores both structured and unstructured data as it is.

      2) It is Fault Tolerant as failure of any node is recovered automatically.

      3) It process complex data easily and very fast.

      4) It works in distributed processing manner that means multiple task execution will
      be done parallelly at the same time.

      5) Hadoop offers a cost effective data storage solutions.

      6) Data is reliably stored on cluster of machines despite of machine failure.

    • #5427
      DataFlair TeamDataFlair Team
      Spectator

      Advantages of using Apache Hadoop are-

      1) Ability to store and process huge amounts data: The HDFS layer can store huge volume of data.

      2) Computing power- Hadoop’s distributed computing model processes data fast.

      3) Fault Tolerant If a node goes down, jobs are automatically redirected to other nodes to make sure the distributed computing does not fail(data availability).Data is also replicated so Fault tolerance is attained(replication).

      4) Low cost- The open-source framework is free and uses commodity hardware to store large quantities of data.

      5) Scalability- Cluster can be grown easily handle more data simply by adding nodes(horizontal scaling).

      6) Not bounded by single Schema.

      7) Reading data and Computing data in parallel makes it fast.

      8) Computation moves to data, making Hadoop efficient and fast compared to traditional approach(Data locality).

Viewing 2 reply threads
  • You must be logged in to reply to this topic.