Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It's been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love to … Continue reading Post 52 | HDPCD | The conclusion

Post 51 | HDPCD | Set Hadoop or Hive Configuration property

Hello, everyone. Welcome to the last technical tutorial in the HDPCD certification series. It's funny! This beautiful journey is coming to an end. In the last tutorial, we saw how to sort the output of a Hive query across multiple reducers. In this tutorial, we are going to see how to set a Hadoop or Hive configuration … Continue reading Post 51 | HDPCD | Set Hadoop or Hive Configuration property

Post 50 | HDPCD | Order Hive query output across multiple reducers

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to run a subquery within a Hive query. Let us begin, then. The following infographics show the step-by-step process of performing this operation. From … Continue reading Post 50 | HDPCD | Order Hive query output across multiple reducers

Post 48 | HDPCD | Printing the execution plan of a Hive query

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest tutorials in this certification series. In … Continue reading Post 48 | HDPCD | Printing the execution plan of a Hive query

Post 47 | HDPCD | Run a Hive query using Vectorization

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to run a Hive Query using TeZ execution engine. In this tutorial, we are going to see how to run a Hive Query using Vectorization. Let us begin, then. Before starting off with the objective of this tutorial, let … Continue reading Post 47 | HDPCD | Run a Hive query using Vectorization

Post 37 | HDPCD | Specifying delimiter of a Hive table

Hello, everyone. Thanks for coming back for one more tutorial in this HDPCD certification series. In the last tutorial, we saw how to specify the storage format of a Hive table. In this tutorial, we are going to see how to specify the delimiter of a Hive table. We are going to follow the process … Continue reading Post 37 | HDPCD | Specifying delimiter of a Hive table

Post 34 | HDPCD | Defining Hive Table using an ORC File Format

Hi, everyone. Thanks for joining me today for this tutorial. In the last tutorial, we saw how to create a hive table using the SELECT query. In this tutorial, we are going to see how to create a hive table which stores the data in the ORC File Format. The process of creating this table … Continue reading Post 34 | HDPCD | Defining Hive Table using an ORC File Format

Post 33 | HDPCD | Define a Table from a SELECT Query

Hello everyone and welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to define a BUCKETED hive table. In this tutorial, we are going to see how to create a Hive table from a SELECT query. Let us begin then. We are going to follow the below … Continue reading Post 33 | HDPCD | Define a Table from a SELECT Query

Post 32 | HDPCD | Defining a Bucketed Hive Table

Hello everyone to the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to create a Partitioned Hive Table. In this tutorial, we are going to see how to create a Bucketed Hive table. The process is depicted in the following infographics. As you can see from the above picture, … Continue reading Post 32 | HDPCD | Defining a Bucketed Hive Table

Post 31 | HDPCD | Defining a Partitioned Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to create the hive external table. In this tutorial, we are going to see how to create a partitioned Hive table. For doing this, we are going to follow the following process. As you can see … Continue reading Post 31 | HDPCD | Defining a Partitioned Hive table