Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It’s been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love toContinue reading “Post 52 | HDPCD | The conclusion”

Post 48 | HDPCD | Printing the execution plan of a Hive query

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the┬álast┬átutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest┬átutorials in this certification series. InContinue reading “Post 48 | HDPCD | Printing the execution plan of a Hive query”

Post 8 | HDPCD | Configure Flume Memory Channel

In the last tutorial, we saw the process to start the flume agent. This tutorial is an extension to the previous tutorial, so please refer to it before getting started with this tutorial. The last tutorial enabled us to start the flume agent, after which we can send the messages that we want over flumeContinue reading “Post 8 | HDPCD | Configure Flume Memory Channel”

Post 7 | HDPCD | Starting Flume Agent

Hello everyone, hope you are finding the tutorials quite useful. In the previous post, we performed the Sqoop Export operation. In this tutorial, we are going to start the flume agent. Flume is one of the projects of Apache Ecosystem. Apache Flume is a reliable and distributed service for moving a large amount of logContinue reading “Post 7 | HDPCD | Starting Flume Agent”