Post 31 | HDPCD | Defining a Partitioned Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to create the hive external table. In this tutorial, we are going to see how to create a partitioned Hive table. For doing this, we are going to follow the following process. As you can seeContinue reading “Post 31 | HDPCD | Defining a Partitioned Hive table”

Post 25 | HDPCD | Register a Jar file of UDF in Apache Pig

Hello, everyone.¬†Thanks for coming back again to continue with this certification series. In the last tutorial, we saw how to run any pig script with TEZ as the execution mode. In this tutorial, we are going to see how to register a JAR file to use the User Defined Function written and packages inside it.Continue reading “Post 25 | HDPCD | Register a Jar file of UDF in Apache Pig”

Post 24 | HDPCD | Run a Pig job using TEZ

Hey, everyone. Thank you for giving me company on this beautiful journey of HDPCD certification. We are almost done with the Data Transformation section of the certification and are only left with Data Analysis section using Apache Hive. The section of Data Analysis, in my opinion, is easier than this section so you can sayContinue reading “Post 24 | HDPCD | Run a Pig job using TEZ”

Post 2 | Machine Learning | Installations – R and Python

Hello, everyone, we are going to start off learning the concepts of Machine Learning. If you are following my blog posts on Hadoop and Big Data Analytics, then you will come to know I do give more importance on performing the hands-on exercises. Same is going to be the case for these tutorials. Here, weContinue reading “Post 2 | Machine Learning | Installations – R and Python”

Post 23 | HDPCD | Perform a REPLICATED JOIN using Apache Pig

Hey everyone, thank you once again for keep on coming back to perform these tutorials. In the last tutorial, we saw how to perform the simple JOIN Operation and in this tutorial, we are going to perform the REPLICATED JOIN Operation. ¬†The process is similar and there is a difference only at one place, soContinue reading “Post 23 | HDPCD | Perform a REPLICATED JOIN using Apache Pig”

HDPCD Certification – Post 1

In this post series, I am going to talk about the Hortonworks Data Platform Certified Developer Certification, also known as HDPCD. We’ll kick-off the proceedings with the introduction to this certification exam and the things need to be covered in order to earn a verified digital badge, proving your certification. Therefore, few facts about thisContinue reading “HDPCD Certification – Post 1”