How can Flume be used with HBase?

Viewing 1 reply thread
  • Author
    Posts
    • #4647
      DataFlair TeamDataFlair Team
      Spectator

      How can Flume be used with HBase in Hadoop?

    • #4723
      DataFlair TeamDataFlair Team
      Spectator

      By using one of the two HBase sinks, we can use Apache Flume with HBase –

      – HBaseSink
      – AsyncHBaseSink

      HBaseSink Working –

      Basically, a Flume Event is converted into HBase Increments or Puts, in HBaseSink. Moreover, HBaseEventSerializer implemented by the Serializer which is further instantiated when the sink starts. Also, in the serializer, sink calls the initialize method which further translates the Flume Event into HBase increments and also puts to be sent to HBase cluster, for each event.

      AsyncHBaseSink Working-

      Generally, AsyncHBaseEventSerializer implemented by the AsyncHBaseSink. Moreover, only once when the sink starts, it calls the initialize method. And, as same as HBase sink, here also Sink invokes the setEvent method and makes calls to the getActions and getIncrements methods. Further, the cleanUp method is called by the serializer, when the sink stops.

Viewing 1 reply thread
  • You must be logged in to reply to this topic.