Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It's been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love to … Continue reading Post 52 | HDPCD | The conclusion

Post 30 | HDPCD | Define a Hive External Table

Hello, everyone! Welcome to the third tutorial in the Data Analysis section of the HDPCD certification. In the last tutorial, we saw how to create the hive-managed or internal table. In this tutorial, we are going to create the hive external table. So, let us start with the process. The following infographics show the process … Continue reading Post 30 | HDPCD | Define a Hive External Table

Set Hive in Local / Auto Mode

Setting Hive in "AUTO/LOCAL MODE" comes very handy when you are dealing with small amount of data. How this works? Well, suppose you are running some complex hive query, then you know that it will trigger MapReduce job in background and will give you the output. This approach works well if the data size is … Continue reading Set Hive in Local / Auto Mode