Learn Hadoop from Scratch Learn Hadoop with Real-time Projects

Getting Started with Hadoop

Learn about Hadoop and its ecosystem components and start your career in Hadoop today. Choose where to begin, learn at your own pace:

Exploring the Ecosystem

Let’s take a look at some interesting facts about Hadoop and its ecosystem.

Hadoop first showed up in December of 2011, although Doug Cutting and Mike Cafarella conceived it in their paper “Google File System”in October of 2003. Hadoop is a collection of open-source software tools that allow using a network of many computers to solve problems involving massive amounts of data and computation. It delivers a software framework for distributed storage and processing of big data using MapReduce. The complete Hadoop and its Ecosystem is made of different components that operate swiftly with each other. These are AVRO, Ambari, Flume, HBase, HCatalog, HDFS, Hadoop, Hive, Impala, MapReduce, Pig, Sqoop, YARN, and ZooKeeper.

Hadoop Ecosystem Doug Cutting

Doug Cutting

Hadoop Ecosystem Mike Cafarella

Mike Cafarella