Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It’s been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love toContinue reading “Post 52 | HDPCD | The conclusion”

Post 50 | HDPCD | Order Hive query output across multiple reducers

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to run a subquery within a Hive query. Let us begin, then. The following infographics show the step-by-step process of performing this operation. FromContinue reading “Post 50 | HDPCD | Order Hive query output across multiple reducers”

Post 48 | HDPCD | Printing the execution plan of a Hive query

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest tutorials in this certification series. InContinue reading “Post 48 | HDPCD | Printing the execution plan of a Hive query”

Post 47 | HDPCD | Run a Hive query using Vectorization

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to run a Hive Query using TeZ execution engine. In this tutorial, we are going to see how to run a Hive Query using Vectorization. Let us begin, then. Before starting off with the objective of this tutorial, letContinue reading “Post 47 | HDPCD | Run a Hive query using Vectorization”

Post 42 | HDPCD | Update a row in a Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to load the compressed data into a Hive table. In this tutorial, we are going to see how to update a row in the Hive table. It is quite interesting to see that Hive supports ACIDContinue reading “Post 42 | HDPCD | Update a row in a Hive table”

Post 39 | HDPCD | Load data into a Hive table from an HDFS directory

Hello, everyone. Thanks for returning for the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to load data into a Hive table from a local directory. In this tutorial, we are going to see how to load the data from the local Directory into the Hive table. Let us begin then.Continue reading “Post 39 | HDPCD | Load data into a Hive table from an HDFS directory”

Post 37 | HDPCD | Specifying delimiter of a Hive table

Hello, everyone. Thanks for coming back for one more tutorial in this HDPCD certification series. In the last tutorial, we saw how to specify the storage format of a Hive table. In this tutorial, we are going to see how to specify the delimiter of a Hive table. We are going to follow the processContinue reading “Post 37 | HDPCD | Specifying delimiter of a Hive table”

Post 33 | HDPCD | Define a Table from a SELECT Query

Hello everyone and welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to define a BUCKETED hive table. In this tutorial, we are going to see how to create a Hive table from a SELECT query. Let us begin then. We are going to follow the belowContinue reading “Post 33 | HDPCD | Define a Table from a SELECT Query”

Post 32 | HDPCD | Defining a Bucketed Hive Table

Hello everyone to the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to create a Partitioned Hive Table. In this tutorial, we are going to see how to create a Bucketed Hive table. The process is depicted in the following infographics. As you can see from the above picture,Continue reading “Post 32 | HDPCD | Defining a Bucketed Hive Table”

Post 31 | HDPCD | Defining a Partitioned Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to create the hive external table. In this tutorial, we are going to see how to create a partitioned Hive table. For doing this, we are going to follow the following process. As you can seeContinue reading “Post 31 | HDPCD | Defining a Partitioned Hive table”