

{"id":78,"date":"2016-06-04T07:48:26","date_gmt":"2016-06-04T07:48:26","guid":{"rendered":"http:\/\/data-flair.training\/blogs\/?p=78"},"modified":"2021-08-25T22:34:15","modified_gmt":"2021-08-25T17:04:15","slug":"apache-hadoop-hdfs-introduction-tutorial","status":"publish","type":"post","link":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/","title":{"rendered":"Apache Hadoop HDFS &#8211; An Introduction to HDFS"},"content":{"rendered":"<div class='__iawmlf-post-loop-links' style='display:none;' data-iawmlf-post-links='[{&quot;id&quot;:2324,&quot;href&quot;:&quot;https:\\\/\\\/hadoop.apache.org\\\/docs\\\/r1.2.1\\\/hdfs_design.html&quot;,&quot;archived_href&quot;:&quot;http:\\\/\\\/web-wp.archive.org\\\/web\\\/20251004005724\\\/https:\\\/\\\/hadoop.apache.org\\\/docs\\\/r1.2.1\\\/hdfs_design.html&quot;,&quot;redirect_href&quot;:&quot;&quot;,&quot;checks&quot;:[{&quot;date&quot;:&quot;2025-12-11 03:56:23&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2025-12-14 09:53:08&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2025-12-17 12:20:57&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2025-12-20 15:28:27&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2025-12-23 15:31:00&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2025-12-27 07:03:50&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2025-12-30 07:03:59&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-02 08:16:19&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-05 09:48:00&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-08 13:08:28&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-11 13:53:21&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-14 18:46:44&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-17 23:13:23&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-21 02:11:09&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-24 03:50:32&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-27 04:47:32&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-01-30 05:44:37&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-02 06:39:48&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-05 07:42:36&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-08 11:08:20&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-11 13:16:33&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-14 14:41:19&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-17 17:07:04&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-20 18:54:13&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-24 07:43:58&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-02-27 12:00:15&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-02 13:35:05&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-05 13:50:35&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-09 02:43:14&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-12 23:35:20&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-16 01:39:23&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-19 15:11:57&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-23 04:19:36&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-26 04:52:31&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-03-29 13:58:44&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-01 15:49:29&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-04 17:03:01&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-08 05:34:46&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-11 06:40:00&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-14 10:12:16&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-17 13:15:34&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-20 16:19:16&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-23 17:33:22&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-27 06:17:27&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-04-30 06:23:47&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-03 10:14:11&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-06 16:08:00&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-10 08:28:08&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-13 11:44:24&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-17 06:06:54&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-20 08:02:43&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-23 10:14:23&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-26 10:17:26&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-05-29 12:21:21&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-06-01 13:42:21&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-06-04 17:49:02&quot;,&quot;http_code&quot;:206},{&quot;date&quot;:&quot;2026-06-08 03:17:13&quot;,&quot;http_code&quot;:206}],&quot;broken&quot;:false,&quot;last_checked&quot;:{&quot;date&quot;:&quot;2026-06-08 03:17:13&quot;,&quot;http_code&quot;:206},&quot;process&quot;:&quot;done&quot;}]'><\/div>\n<h2>1. Objective<\/h2>\n<p>In this Hadoop\u00a0tutorial, we will discuss World&#8217;s most reliable storage system &#8211;<a href=\"http:\/\/data-flair.training\/blogs\/comprehensive-hdfs-guide-introduction-architecture-data-read-write-tutorial\/\"> HDFS (Hadoop Distributed File System)<\/a>. HDFS is Hadoop&#8217;s storage layer which provides <a href=\"http:\/\/data-flair.training\/blogs\/hadoop-high-availability-tutorial\/\">high availability<\/a>, reliability and fault tolerance. It is anticipated that world&#8217;s 75% of data will be stored in Hadoop HDFS by the end of 2017. This tutorial will provide a complete overview of what is HDFS? This introductory guide will cover basics of HDFS, HDFS introduction, HDFS nodes, HDFS daemons, etc.<\/p>\n<div id=\"attachment_41994\" style=\"width: 1210px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-41994\" class=\"size-full wp-image-41994\" src=\"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg\" alt=\"Apache Hadoop HDFS Tutorial\" width=\"1200\" height=\"628\" srcset=\"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg 1200w, https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01-150x79.jpg 150w, https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01-300x157.jpg 300w, https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01-768x402.jpg 768w, https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01-1024x536.jpg 1024w, https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01-520x272.jpg 520w\" sizes=\"auto, (max-width: 1200px) 100vw, 1200px\" \/><\/a><p id=\"caption-attachment-41994\" class=\"wp-caption-text\">Apache Hadoop HDFS Tutorial<\/p><\/div>\n<h2>2. What is Hadoop HDFS?<\/h2>\n<p>Apache Hadoop HDFS is a distributed file system which provides redundant storage space for storing files which are huge in sizes; files which are in the range of Terabytes and Petabytes. In HDFS data is stored reliably. Files are broken into blocks and distributed across nodes in a cluster. After that each block is replicated, means copies of blocks are created on different machines. Hence if a machine goes down or gets crashed, then also we can easily retrieve and access our data from different machines. By default, 3 copies of a file are created on different machines. Hence it is highly fault-tolerant. HDFS provides faster file <a href=\"http:\/\/data-flair.training\/blogs\/hadoop-hdfs-data-read-and-write-operations\/\">read and writes mechanism<\/a>, as data is stored in different nodes in a cluster. Hence the user can easily access the data from any machine in a cluster. Hence HDFS is highly used as a platform for storing huge volume and different varieties of data worldwide.<\/p>\n<p>Before working with HDFS you must have Hadoop installed and running, to <a href=\"http:\/\/data-flair.training\/blogs\/install-cloudera-hadoop-cdh5-ubuntu\/\">install and configure Hadoop follow this Installation Guide<\/a>.<\/p>\n<h2>3.\u00a0HDFS Nodes<\/h2>\n<p>HDFS has Master\/slave architecture. There are two nodes in HDFS: Master and Slaves. The master node maintains various data storage and processing management services in distributed <a href=\"http:\/\/data-flair.training\/blogs\/install-hadoop-1-x-on-multi-node-cluster\/\">Hadoop clusters<\/a>. The actual data in <a href=\"http:\/\/data-flair.training\/blogs\/top-hdfs-commands-tutorial\/\">HDFS<\/a> is stored in Slave nodes. Data is also processed on the slave nodes.<\/p>\n<h3>3.1. Master<\/h3>\n<p>Master is the centerpiece of HDFS. It stores the metadata of <a href=\"http:\/\/data-flair.training\/blogs\/interact-hadoop-hdfs-commands-perform-operations\/\">HDFS<\/a>. All the information related to files stored in HDFS gets stored in Master. It also gives information about where across the cluster the file data is kept. Master contains information about the details of the blocks and its location for all files present in HDFS. The idea of constructing the file from blocks comes with the help of this information to the master. Master is the most critical part of HDFS and if all the masters get crashed or down then the HDFS cluster is also considered down and becomes useless.<\/p>\n<h3>3.2. Slave<\/h3>\n<p>The actual files or the data of client is present on the slaves. The most important and useful functionality of slaves is to control storage attached to the nodes in which they run. As we know that, in HDFS files are broken down into smaller blocks and these blocks are distributed across nodes in the cluster. The slaves within the cluster manage these file blocks. And in order to perform all filesystem operations, it sends information to the Master about the blocks present. HDFS has more than one slaves, and the replicas of blocks are created across them.<\/p>\n<p>Learn the Internals of HDFS Data Read Operation, Follow this <a href=\"http:\/\/data-flair.training\/blogs\/data-read-operation-in-hdfs\/\">tutorial to understand How Data flows in HDFS while reading the file<\/a><\/p>\n<h2>4. HDFS Daemons<\/h2>\n<p>In Hadoop HDFS there are three daemons. All the daemons run on their own JVMs in the background to support required services.<\/p>\n<h3>4.1. NameNode<\/h3>\n<p>Namenode is the master daemon of HDFS which runs on all the masters. It manages the HDFS filesystem namespace. NameNode keeps the record of all the files present in the HDFS. NameNode also keeps the record of the changes created in file system namespace.<\/p>\n<h3>4.2. DataNode<\/h3>\n<p>Datanode is the slave daemon of HDFS which runs on all the slaves. The function of DataNode is to store data in the HDFS. It contains the actual data blocks. HDFS cluster usually has more than one DataNodes. Data is replicated across the other machines present in the HDFS cluster.<\/p>\n<h3>4.3. SecondaryNameNode<\/h3>\n<p>The job of SecondaryNameNode is to perform backup and record-keeping functions for the NameNode. Secondary Namenode periodically pulls the data from namenode, so if namemode goes down we can manually make secondary NN as Namenode. One important point, it is not a hot standby of namenode.<\/p>\n<h2>5. How Data gets Stored in HDFS<\/h2>\n<p>In Hadoop HDFS data files are divided into smaller chunks called blocks. Now, these blocks are then distributed across a group of machines which are known as slaves. Here slave machines create replica of these blocks and distribute across other machines in the cluster. Now individual slaves send reports to the master containing information about the files and blocks stored on them. When slaves receive instructions like add\/copy\/move\/delete, etc. from the master then slaves performs the particular operations on the file system. After this, the slave sends a report to the master regarding completion of the task. <a href=\"http:\/\/data-flair.training\/blogs\/data-write-pipeline-operation-hdfs\/\">Learn\u00a0Internals of HDFS Data Write Pipeline and File write execution flow<\/a><\/p>\n<h2>6. Blocks in HDFS<\/h2>\n<p>Blocks in HDFS is the segment of a file. These segments of files get stored on the nodes present in HDFS cluster and now the replicas of these blocks are created on the other nodes in the cluster. The data stored in HDFS is split by the framework. The default block size in<a href=\"http:\/\/data-flair.training\/blogs\/features-hadoop-hdfs-overview-beginners\/\"> HDFS<\/a> is 128 MB. We can increase the blocks size as per the requirements. These blocks are distributed across different machines. Now replicas of these blocks are created on different machines in the cluster. By default minimum, three copies of a block are created (which is configurable) on other machines. So if a machine goes down, then blocks stored on that machine can be accessed from other two machines.<\/p>\n<h2>7. Heartbeat Message<\/h2>\n<p>All the slaves send a message to the masters just like a heartbeat in every 3 seconds to inform that they are alive. If no heartbeat message is received by masters from any particular slave for more than 10 minutes, then it considers that slave has failed and now it is not working and hence it start creating a replication of blocks which were available on that slave. Now the slaves can talk to each other to rebalance data, by moving and copying the data to each other to keep the required replication. As the environment is distributed there should be some mechanism from which master will come to know the current status of all the slaves in the cluster. Hence all the slaves continuously send a small heartbeat message (signals) to master to tell &#8220;I am Alive&#8221;. if master found any machine dead it will not allocate any new work submitted by the client.<\/p>\n<p>To play with Hadoop HDFS using commands <a href=\"http:\/\/data-flair.training\/blogs\/most-used-hdfs-commands-examples\/\">follow HDFS commands Guide<\/a>. and to Learn what is Rack Awareness in Hadoop HDFS <a href=\"http:\/\/data-flair.training\/blogs\/rack-awareness-hadoop-hdfs\/\">follow this tutorial<\/a>.<\/p>\n<p><a href=\"https:\/\/hadoop.apache.org\/docs\/r1.2.1\/hdfs_design.html\">Reference<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>1. Objective In this Hadoop\u00a0tutorial, we will discuss World&#8217;s most reliable storage system &#8211; HDFS (Hadoop Distributed File System). HDFS is Hadoop&#8217;s storage layer which provides high availability, reliability and fault tolerance. It is&#46;&#46;&#46;<\/p>\n","protected":false},"author":7,"featured_media":41994,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[25],"tags":[1907,1971,3409,3416,3514,3722,5548,5550,5563,5577,5586,8126,15746],"class_list":["post-78","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-hdfs","tag-big-data","tag-big-data-training","tag-data-read","tag-data-science","tag-data-write","tag-define-hdfs","tag-hdfs","tag-hdfs-basics","tag-hdfs-dfs","tag-hdfs-introduction","tag-hdfs-overview","tag-learn","tag-what-is-hdfs"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v27.4 - https:\/\/yoast.com\/product\/yoast-seo-wordpress\/ -->\n<title>Apache Hadoop HDFS - An Introduction to HDFS - DataFlair<\/title>\n<meta name=\"description\" content=\"HDFS tutorial-what is Hadoop HDFS,HDFS introduction,How Data is Stored in Hadoop distributed file system,heartbeat in HDFS,HDFS Nodes,HDFS blocks,HDFS Daemon\" \/>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Apache Hadoop HDFS - An Introduction to HDFS - DataFlair\" \/>\n<meta property=\"og:description\" content=\"HDFS tutorial-what is Hadoop HDFS,HDFS introduction,How Data is Stored in Hadoop distributed file system,heartbeat in HDFS,HDFS Nodes,HDFS blocks,HDFS Daemon\" \/>\n<meta property=\"og:url\" content=\"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/\" \/>\n<meta property=\"og:site_name\" content=\"DataFlair\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/DataFlairWS\/\" \/>\n<meta property=\"article:published_time\" content=\"2016-06-04T07:48:26+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2021-08-25T17:04:15+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"1200\" \/>\n\t<meta property=\"og:image:height\" content=\"628\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"DataFlair Team\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@DataFlairWS\" \/>\n<meta name=\"twitter:site\" content=\"@DataFlairWS\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"DataFlair Team\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"6 minutes\" \/>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Apache Hadoop HDFS - An Introduction to HDFS - DataFlair","description":"HDFS tutorial-what is Hadoop HDFS,HDFS introduction,How Data is Stored in Hadoop distributed file system,heartbeat in HDFS,HDFS Nodes,HDFS blocks,HDFS Daemon","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/","og_locale":"en_US","og_type":"article","og_title":"Apache Hadoop HDFS - An Introduction to HDFS - DataFlair","og_description":"HDFS tutorial-what is Hadoop HDFS,HDFS introduction,How Data is Stored in Hadoop distributed file system,heartbeat in HDFS,HDFS Nodes,HDFS blocks,HDFS Daemon","og_url":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/","og_site_name":"DataFlair","article_publisher":"https:\/\/www.facebook.com\/DataFlairWS\/","article_published_time":"2016-06-04T07:48:26+00:00","article_modified_time":"2021-08-25T17:04:15+00:00","og_image":[{"width":1200,"height":628,"url":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg","type":"image\/jpeg"}],"author":"DataFlair Team","twitter_card":"summary_large_image","twitter_creator":"@DataFlairWS","twitter_site":"@DataFlairWS","twitter_misc":{"Written by":"DataFlair Team","Est. reading time":"6 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"Article","@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#article","isPartOf":{"@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/"},"author":{"name":"DataFlair Team","@id":"https:\/\/data-flair.training\/blogs\/#\/schema\/person\/beb0cab24b7aa54423a3b50e669a9dcd"},"headline":"Apache Hadoop HDFS &#8211; An Introduction to HDFS","datePublished":"2016-06-04T07:48:26+00:00","dateModified":"2021-08-25T17:04:15+00:00","mainEntityOfPage":{"@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/"},"wordCount":1136,"commentCount":1,"publisher":{"@id":"https:\/\/data-flair.training\/blogs\/#organization"},"image":{"@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#primaryimage"},"thumbnailUrl":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg","keywords":["big data","big data training","data read","data science","data write","Define HDFS","hdfs","HDFS Basics","HDFS DFS","HDFS Introduction","HDFS Overview","learn","What is HDFS"],"articleSection":["HDFS Tutorials"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#respond"]}]},{"@type":"WebPage","@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/","url":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/","name":"Apache Hadoop HDFS - An Introduction to HDFS - DataFlair","isPartOf":{"@id":"https:\/\/data-flair.training\/blogs\/#website"},"primaryImageOfPage":{"@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#primaryimage"},"image":{"@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#primaryimage"},"thumbnailUrl":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg","datePublished":"2016-06-04T07:48:26+00:00","dateModified":"2021-08-25T17:04:15+00:00","description":"HDFS tutorial-what is Hadoop HDFS,HDFS introduction,How Data is Stored in Hadoop distributed file system,heartbeat in HDFS,HDFS Nodes,HDFS blocks,HDFS Daemon","breadcrumb":{"@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#primaryimage","url":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg","contentUrl":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/06\/Apache-Hadoop-HDFS-01.jpg","width":1200,"height":628,"caption":"Apache Hadoop HDFS Tutorial"},{"@type":"BreadcrumbList","@id":"https:\/\/data-flair.training\/blogs\/apache-hadoop-hdfs-introduction-tutorial\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Blog Home","item":"https:\/\/data-flair.training\/blogs\/"},{"@type":"ListItem","position":2,"name":"HDFS Tutorials","item":"https:\/\/data-flair.training\/blogs\/category\/hdfs\/"},{"@type":"ListItem","position":3,"name":"Apache Hadoop HDFS &#8211; An Introduction to HDFS"}]},{"@type":"WebSite","@id":"https:\/\/data-flair.training\/blogs\/#website","url":"https:\/\/data-flair.training\/blogs\/","name":"DataFlair","description":"Learn Today. Lead Tomorrow.","publisher":{"@id":"https:\/\/data-flair.training\/blogs\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/data-flair.training\/blogs\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/data-flair.training\/blogs\/#organization","name":"DataFlair","url":"https:\/\/data-flair.training\/blogs\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/data-flair.training\/blogs\/#\/schema\/logo\/image\/","url":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/07\/Data-Flair.png","contentUrl":"https:\/\/data-flair.training\/blogs\/wp-content\/uploads\/sites\/2\/2016\/07\/Data-Flair.png","width":106,"height":48,"caption":"DataFlair"},"image":{"@id":"https:\/\/data-flair.training\/blogs\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/DataFlairWS\/","https:\/\/x.com\/DataFlairWS","https:\/\/www.linkedin.com\/company\/dataflair-web-services-pvt-ltd\/","https:\/\/www.youtube.com\/user\/DataFlairWS"]},{"@type":"Person","@id":"https:\/\/data-flair.training\/blogs\/#\/schema\/person\/beb0cab24b7aa54423a3b50e669a9dcd","name":"DataFlair Team","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/secure.gravatar.com\/avatar\/c322416204232f4dd97ef3901b0a499a5d34d7ba7fe333f4bfe53a907873d293?s=96&d=mm&r=g","url":"https:\/\/secure.gravatar.com\/avatar\/c322416204232f4dd97ef3901b0a499a5d34d7ba7fe333f4bfe53a907873d293?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/c322416204232f4dd97ef3901b0a499a5d34d7ba7fe333f4bfe53a907873d293?s=96&d=mm&r=g","caption":"DataFlair Team"},"description":"DataFlair Team specializes in creating clear, actionable content on programming, Java, Python, C++, DSA, AI, ML, data Science, Android, Flutter, MERN, Web Development, and technology. Backed by industry expertise, we make learning easy and career-oriented for beginners and pros alike.","url":"https:\/\/data-flair.training\/blogs\/author\/dfteam3\/"}]}},"amp_enabled":true,"_links":{"self":[{"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/posts\/78","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/comments?post=78"}],"version-history":[{"count":6,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/posts\/78\/revisions"}],"predecessor-version":[{"id":41995,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/posts\/78\/revisions\/41995"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/media\/41994"}],"wp:attachment":[{"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/media?parent=78"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/categories?post=78"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/data-flair.training\/blogs\/wp-json\/wp\/v2\/tags?post=78"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}