Hadoop
is most popular and powerful big data tool.
It provides
1-HDFS ->>(WOLRD MOST RELIABLE STORAGE LAYER)
2-MAPREDUCE->>(DISTRIBUTED PROCESSING)
3-YARN->>(RESOURCE MANAGEMENT LAYER)
There are some features of Hadoop listed below:
1-DEADLOCK->Hadoop will never go under Deadlock condition.
2-OPEN-SOURCE->It means that we can modify the source code upon our business requirement and it will open the gate for the researcher.
3-DISTRIBUTED PROCESSING->Data is stored in a distributed manner so that data is processed parallel on a cluster of nodes.
4-FAULT TOLERANCE->It can store 3 replicas by default on different datanode of the different rack so if any node goes down, data on that node will be recovered from other nodes.
5-RELIABILITY->Data is stored reliably on a cluster of the machine due to replication of data in the cluster.
6-SCALABILITY->It provide horizontal scalability so that new hardware can easly added to nodes.
7-HIGH AVAILABILITY->It means available in worst condition.it will never show a message that DATA IS NOT AVAILABLE.
8-ECONOMIC->it runs on commodity hardware so it is not expensive.
9-EASY TO USE->It will perform distributed processing by himself.
10->DATA LOCALITY->it run DATA LOCALITY principle which state that move computation to data instead of data to computation.
To learn more about features of Hadoop follow: Hadoop Features