Learn Hadoop from Scratch Learn Hadoop with Real-time Projects

Getting Started with Hadoop

Learn all about the ecosystem and get started with Hadoop today. Choose where to begin, learn at your own pace:

Exploring the Ecosystem

Let’s take a look at some facts about Hadoop and the entire ecosystem.

Hadoop first showed up in December of 2011, although Doug Cutting and Mike Cafarella conceived it in their paper “Google File System”in October of 2003. It is a collection of open-source software tools that allow using a network of many computers to solve problems involving massive amounts of data and computation. It delivers a software framework for distributed storage and processing of big data using MapReduce. The entire Hadoop Ecosystem is made of a layer of components that operate swiftly with each other. These are AVRO, Ambari, Flume, HBase, HCatalog, HDFS, Hadoop, Hive, Impala, MapReduce, Pig, Sqoop, YARN, and ZooKeeper.

Hadoop Ecosystem Doug Cutting

Doug Cutting

Hadoop Ecosystem Mike Cafarella

Mike Cafarella