Differentiate HDFS and HBase?

Viewing 2 reply threads
  • Author
    Posts
    • #4582
      DataFlair TeamDataFlair Team
      Spectator

      What is the difference between HDFS and HBase?
      Comparision between HDFS and HBase?

    • #4583
      DataFlair TeamDataFlair Team
      Spectator

      HBase is a non-relational database that can run on top of Hadoop and provides you random data access capabilities. HDFS, by itself has no support for reads/writes at random location.

      Another primary difference would be the way data is stored in the two. HBase stores data as key/value pairs as in a column database (something similar to Cassandra DB) while in HDFS data stored as flat files.

      HBase is an extension for the Hadoop environment that allows you to quickly read/write data.

    • #4603
      DataFlair TeamDataFlair Team
      Spectator

      Let’s compare both on the basis of some features:

      HBase vs HDFS

      a. Built on

      HBase
      It is built on top of the HDFS.

      HDFS
      For storing large files, it is suitable.

      b. lookups

      HBase
      Especially, for larger tables, it offers fast lookups.

      HDFS
      But it does not offer fast lookups.

      C. Latency

      HBase
      HBase offers low latency access.

      HDFS
      Whereas, HDFS provides high latency batch processing; yet no support for batch processing.

Viewing 2 reply threads
  • You must be logged in to reply to this topic.