What are the features of Hadoop HDFS?

This topic has 2 replies, 1 voice, and was last updated 5 years, 7 months ago by DataFlair Team.

Viewing 2 reply threads

Author

Posts
- September 20, 2018 at 5:45 pm #6328
  
  DataFlair Team
  Spectator
  
  What are the Characteristics of HDFS?
- September 20, 2018 at 5:45 pm #6330
  
  DataFlair Team
  Spectator
  
  1). HDFS uses GFS pattern for data storage and processing.
  2). It is designed to run on commodity hardware.
  3). It handles huge volumes of data.
  4). It is reliable, Fault-Tolerance and provides distributed computing.
  5). HDFS provides file permissions and authentication.
  6). The built-in servers of namenode and datanode help users to easily check the status of the cluster.
  
  For more detail follow HDFS Features
- September 20, 2018 at 5:45 pm #6331
  
  DataFlair Team
  Spectator
  
  1. HDFS provides a write-once-read-many access model.
  2.HDFS is built using the Java language making it portable across various platforms.
  3. Economic: The HDFS is deployed in low commodity hardware(cheap hardware).So the cost for the infrastruture will be less.
  4. Variety and Volume of Data: In HDFS we can store huge data tera or petabytes of data and also different kinds of data structured or unstructured data.
  5. Fault-Tolerance: HDFS is highly fault tolerant , in HDFS data is divided into blocks and multiple blocks of information is copied across the machines in the cluster. If any machine goes down we can get the data from other machines.
  6.Data Reliability: The data is stored reliably by creating copies of data across the nodes and so provides Fault Tolerance.
  7.Data Integrity: The data in the nodes are checked whether correct or not .HDFS constantly checks the integrity of data stored aganist its checksum.If the data is fault, then it sends to namenode and namenode deletes the fault data and creates aditional replica of data in other nodes.
  8.High Throughput: The data is processed parallel in multiple nodes so we have high throughput i.e,we can access the data from file system
  9. Data Locality: In traditional systems , the data is moved to the processing unit. But here the data is huge ,HDFS moves the processing unit to data which makes computation faster.
  10.scalability: We can scale the cluster as required on run. This feature helps to add more nodes on run without stooping the cluster. There are two ways to achieve it. Horizontal and vertical.
  
  For more detail follow HDFS Features
Author

Posts

Viewing 2 reply threads

You must be logged in to reply to this topic.

What are the features of Hadoop HDFS?

About DataFlair

Trending Data Science Courses

Free Big Data Courses

Trending Programming Courses

Trending Web Dev Courses

Trending Courses

Trending Python Courses

Trending Java Courses

Trending DSA Courses