# R and Hadoop Integration | R Integration with Hadoop

## 1. R and Hadoop Integration

**R and Hadoop Integration**. Also, will learn when to use R and Hadoop combination. Moreover, will study the implementation of R integration with Hadoop. I recommend you to go through Hadoop and R Programming So lets start with integrating r and hadoop for big data analysis.

## 2. Introduction to R With Hadoop Integration

### a. Introduction to R Programming Language

### b. Introduction to Hadoop

## 3. R and Hadoop Integration Purpose

- Use Hadoop to execute R code

- Use R to access data stored in Hadoop

There are 4 types of methods for Integrating R with Hadoop

### a. R Hadoop

The R Hadoop is a collection of 3 packages. Here, we will discuss functionalities of packages.

#### i. The rmr package

#### ii. The rhbase package

The rhdfs package

HDFS

### b. Hadoop Streaming

HBase. Hadoop Streaming is the R Script available as part of the

R package on CRAN. Also, this intends to make R more accessible to Hadoop streaming applications. Moreover, using this you can write MapReduce programs in a language other than

Java

R Language. That makes it extremely user-friendly. As

JAVA is the native language for MapReduce. But according to today's need, it doesn't suit high-speed data analysis. Thus, in toady's we need faster mapping and reducing steps with Hadoop. Hence, Hadoop streaming in demand and use. As we can write the codes in

Python, Perl or even Ruby.

### c. RHIPE

This is an integrated programming environment which was developed by the Divide and Recombine (D & R) for analyzing large amounts of data. As RHIPE stands for R and Hadoop Integrated Programming Environment.

### d. ORCH

It is called as Oracle R Connector. Also, it can be used to exclusively work with Big Data in Oracle appliance. Also, on a non-Oracle framework like Hadoop.

Hadoop Distributed File System

## 5. Conclusion: R Integration with Hadoop

Exploratory Data analysis with R

