What is Hadoop and Big Data?

Viewing 2 reply threads
  • Author
    Posts
    • #5329
      DataFlair TeamDataFlair Team
      Spectator

      What is Hadoop?
      What is Big Data?
      Why are they booming in the industry?

    • #5332
      DataFlair TeamDataFlair Team
      Spectator

      Big data is a phrase used to mean a massive volume of both structured and unstructured data that is so large it is difficult to store and process using traditional database and software techniques. Four dimensions of Bigdata as per IBM:
      Volume – Scale of data (Data size)
      Velocity – Speed of generation of data
      Variety – Different forms of data (structured, semi-structured, unstructured)
      Veracity – Uncertainty of data

      Hadoop is an open source data (big data) processing framework that supports storage as well as processing of large and complex datasets in a distributed computing environment.

      Core components of Hadoop are:

      1) HDFS -Hadoop Distributed File System – It is the most reliable storage system on the planet, which provides reliable, distributed, fault tolerant and scalable file system for data storage.

      2) Yarn – Yet Another Resource Negotiator, it is the resource management layer of Hadoop.

      3) MapReduce – Application layer, which provides distributed computation to process data across the servers.

      Big data has the potential to help companies to improve operations, make intelligent and faster decisions. Using Hadoop, Big data can be captured, stored, formatted, manipulated and analyzed in order to help organizations to derive useful business insights to increase revenues, get or retain customers and improve operations.

      Follow the link to learn more about Big data 
      Follow the link to learn more about Hadoop

    • #5334
      DataFlair TeamDataFlair Team
      Spectator

      1) Hadoop is an open source framework used for storing huge volume of data sets belonging to any format and processing those data sets at a rapid pace by means of distributed computing.

      2) Big data refers to huge sets of structured, semi structured or unstructured data that are mined by the organizations for the purpose of identifying new opportunities. That, in turn, leads to smarter business moves, more efficient operations, higher profits and happier customers.

      3) Big Data is booming due to following reasons:

      a) Big Data bring significant cost advantages when it comes to storing large amounts of data.

      b) Big Data offers the ability to gauge customer needs and satisfaction through analytics. It allows the organization to give customers what they want, thereby, improving their services.

      For more detail please refer: Big data 
      For more detail please refer: Hadoop

Viewing 2 reply threads
  • You must be logged in to reply to this topic.