Apache Hive Features | Limitations of Hive

1. Objective – Hive Features and Limitations

As we know to process structured data in Hadoop, we use Hive. Apart from it, there are several features of Apache Hive. well, it also has several limitations. So, in this Hive Tutorial, we will see “Apache Hive features and limitations of Hive”, we will discuss both features and limitations of Hive. But, before that, we will also learn the introduction of Hive.

Hive features and limitations

Apache Hive Features | Limitations of Hive

2. What is Apache Hive?

Basically, the tool to process structured data in Hadoop we use Hive. It is a data warehouse infrastructure. Moreover, to summarize Big Data, it resides on top of Hadoop. Also, makes querying and analyzing easy.
However, the Apache Software Foundation took it up, but initially, Hive was developed by Facebook. Further Apache Software Foundation developed it as an open source under the name Apache Hive. Although, many different companies use it. Like, Amazon uses it in Amazon Elastic MapReduce.
Follow this link to know more about What is Hive in detail

Hadoop Quiz
If these professionals can make a switch to Big Data, so can you:
Rahul Doddamani Story - DataFlair
Rahul Doddamani
Java → Big Data Consultant, JDA
Follow on
Mritunjay Singh Success Story - DataFlair
Mritunjay Singh
PeopleSoft → Big Data Architect, Hexaware
Follow on
Rahul Doddamani Success Story - DataFlair
Rahul Doddamani
Big Data Consultant, JDA
Follow on
I got placed, scored 100% hike, and transformed my career with DataFlair
Enroll now
Deepika Khadri Success Story - DataFlair
Deepika Khadri
SQL → Big Data Engineer, IBM
Follow on
DataFlair Web Services
You could be next!
Enroll now

3. Apache Hive Features and Limitations

Apache Hive Features | Limitations of Hive

Apache Hive Features | Limitations of Hive

a. Hive Features

Some Hive new features are discussed below:
i. Framework
Apache Hive is built on top of Hadoop distributed framework system (HDFS).
ii. Large datasets
However, in distributed storage, it helps to query large datasets residing. 
iii. Warehouse
Also, we can say Hive is a distributed data warehouse.
iv. Language
Queries data using a SQL-like language called HiveQL (HQL).
v. Declarative language
HiveQL is a declarative language like SQL.
vi. Table structure
Table structure/s is/are similar to tables in a relational database.
vii. Multi-user
Multiple users can simultaneously query the data using Hive-QL.
viii. Data Analysis
However,  to perform more detailed data analysis, Hive allows writing custom MapReduce framework processes.
ix. ETL support
Also, it is possible to extract/transform/load (ETL) Data easily.
x. Data Formats
Moreover, Hive offers the structure on a variety of data formats.
xi. Storage
Hive allows access files stored in HDFS. Also, similar others data storage systems such as Apache HBase.
x. Format conversion
Moreover, it allows converting the variety of format from to within Hive. Although, it is very simple and possible.
Follow this link to know about Hive architecture & Components in detail.

b. Limitations of Hive

i. OLTP Processing issues
However, Hive is not designed for Online transaction processing (OLTP). Although, we can use it for the Online Analytical Processing (OLAP).
ii. No Updates
It does not support updates and deletes, however, it does support overwriting or apprehending data.
iii. Subqueries
Basically, in Hive, Subqueries are not supported.  
Read more about Hive Operators & Hive Data Types in detail 

So, this was all in Features of Hive. Hope you like our explanation. 

4. Conclusion

Hence, we have seen all the Hive features and limitations of Hive. Still, if any query occurs feel free to ask in the comment section.
See Also- HBase vs Hive
For reference

No Responses

  1. Ved Prakash says:

    please clear me meaning of ‘HIVE does support overwriting or apprehending data’

    • Data Flair says:

      Hii Ved Prakash,
      Thank you for asking the query, here is the solution for you –
      Hive supports Overwriting and Apprehending. Here Apprehending means insert the incremental data through queries into the target, For example, INSERT INTO query will append data to the table or partition, keeping the existing data intact. Whereas the keyword ‘OVERWRITE’ signifies that if the OVERWRITE keyword is used when the contents of the target table (or partition) will be deleted and replaced by the new files. For doing so, we use INSERT OVERWRITE query, it overwrites any existing data in the table or partition.
      Hope our this explanation helps you.

Leave a Reply

Your email address will not be published. Required fields are marked *

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.