Free Online Certification Courses – Learn Today. Lead Tomorrow. › Forums › Apache Hadoop › What is Hadoop and Big Data?
- This topic has 2 replies, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.
-
AuthorPosts
-
-
September 20, 2018 at 2:52 pm #5329DataFlair TeamSpectator
What is Hadoop?
What is Big Data?
Why are they booming in the industry? -
September 20, 2018 at 2:52 pm #5332DataFlair TeamSpectator
Big data is a phrase used to mean a massive volume of both structured and unstructured data that is so large it is difficult to store and process using traditional database and software techniques. Four dimensions of Bigdata as per IBM:
Volume – Scale of data (Data size)
Velocity – Speed of generation of data
Variety – Different forms of data (structured, semi-structured, unstructured)
Veracity – Uncertainty of dataHadoop is an open source data (big data) processing framework that supports storage as well as processing of large and complex datasets in a distributed computing environment.
Core components of Hadoop are:
1) HDFS -Hadoop Distributed File System – It is the most reliable storage system on the planet, which provides reliable, distributed, fault tolerant and scalable file system for data storage.
2) Yarn – Yet Another Resource Negotiator, it is the resource management layer of Hadoop.
3) MapReduce – Application layer, which provides distributed computation to process data across the servers.
Big data has the potential to help companies to improve operations, make intelligent and faster decisions. Using Hadoop, Big data can be captured, stored, formatted, manipulated and analyzed in order to help organizations to derive useful business insights to increase revenues, get or retain customers and improve operations.
Follow the link to learn more about Big data
Follow the link to learn more about Hadoop -
September 20, 2018 at 2:52 pm #5334DataFlair TeamSpectator
1) Hadoop is an open source framework used for storing huge volume of data sets belonging to any format and processing those data sets at a rapid pace by means of distributed computing.
2) Big data refers to huge sets of structured, semi structured or unstructured data that are mined by the organizations for the purpose of identifying new opportunities. That, in turn, leads to smarter business moves, more efficient operations, higher profits and happier customers.
3) Big Data is booming due to following reasons:
a) Big Data bring significant cost advantages when it comes to storing large amounts of data.
b) Big Data offers the ability to gauge customer needs and satisfaction through analytics. It allows the organization to give customers what they want, thereby, improving their services.
For more detail please refer: Big data
For more detail please refer: Hadoop
-
-
AuthorPosts
- You must be logged in to reply to this topic.