How to handle stucked process and improve performance?

Free Online Certification Courses – Learn Today. Lead Tomorrow. Forums Apache Hadoop How to handle stucked process and improve performance?

Viewing 1 reply thread
  • Author
    Posts
    • #4813
      DataFlair TeamDataFlair Team
      Spectator

      If we are loading data in hive and process has been stucked or taking very long time so how we are going to debug it to increase the performance?

    • #4815
      DataFlair TeamDataFlair Team
      Spectator

      There are many Hive optimization techniques which help to improve the performance of its process that has been stacked or taking a very long time, that is:

      Usage of Suitable File Format in Hive

      ORCFILE File Formate – Hive Optimization Techniques, on the basis of data, if we use an appropriate file format that drastically increases our query performance. Hence, ORC file format is best suitable for increasing your query performance. ORC is an acronym for Optimized Row Columnar. It means, it is possible to store data in an optimized way than the other file formats, in this format.

      Basically, this format decreases the size of the original data up to 75%. Thus, the speed of data processing also increases. Though, we can say ORC shows better performance, in comparison with Text, Sequence and RC file formats. Simply, it contains rows of data in groups. For example, Stripes along with a file footer. Hence, we can say ORC format improves the performance in Hiveprocessing.

      However, there are many more Hive optimization techniques to improve its performance, to learn all, follow the link: 7 Best Hive Optimization Techniques – Hive Performance

Viewing 1 reply thread
  • You must be logged in to reply to this topic.