Hi everyone. Finally, we have reached the end of this tutorial series. It's been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love to … Continue reading Post 52 | HDPCD | The conclusion
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest tutorials in this certification series. In … Continue reading Post 48 | HDPCD | Printing the execution plan of a Hive query
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to update a row in a Hive table. In this tutorial, we are going to see how to delete a row in the Hive table. It is quite interesting to see that Hive supports ACID operations … Continue reading Post 43 | HDPCD | Delete a row in a Hive table
Hello, everyone! Welcome to the third tutorial in the Data Analysis section of the HDPCD certification. In the last tutorial, we saw how to create the hive-managed or internal table. In this tutorial, we are going to create the hive external table. So, let us start with the process. The following infographics show the process … Continue reading Post 30 | HDPCD | Define a Hive External Table
Hello everyone, welcome back to another post related to Hadoop Ecosystem. Lot of people have approached me regarding the prerequisites required before learning Hadoop, so this is for those poeple who are beginners and want to learn Hadoop Ecosystem. I consider following three components as the prerequisites for learning Hadoop. Linux File System and Commands … Continue reading Prerequisites for Hadoop – Part 1
Hi guys, Following code will enable us to read Microsoft Word Document file using JAVA API. https://gist.github.com/milindjagre/8495bd3ec78d9ad172d31f5eebf0689b Following is the pom.xml contents https://gist.github.com/milindjagre/7e3c444e55a87d00fdfc9fe053bbc0cf Following is the word file contents Following is the output of code. Thanks for having a read. Do comment below for your queries.