Post 45 | HDPCD | Join two Hive tables

  Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to insert a new row into a Hive table. In this tutorial, we are going to see how to join two Hive tables. Let us begin, then. The above info-graphics show the step by step processContinue reading “Post 45 | HDPCD | Join two Hive tables”

Post 44 | HDPCD | Insert a row in the Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to delete a row in a Hive table. In this tutorial, we are going to see how to insert a row in the Hive table. It is quite interesting to see that Hive supports ACID operationsContinue reading “Post 44 | HDPCD | Insert a row in the Hive table”

Post 43 | HDPCD | Delete a row in a Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to update a row in a Hive table. In this tutorial, we are going to see how to delete a row in the Hive table. It is quite interesting to see that Hive supports ACID operationsContinue reading “Post 43 | HDPCD | Delete a row in a Hive table”

Post 42 | HDPCD | Update a row in a Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to load the compressed data into a Hive table. In this tutorial, we are going to see how to update a row in the Hive table. It is quite interesting to see that Hive supports ACIDContinue reading “Post 42 | HDPCD | Update a row in a Hive table”

Post 41 | HDPCD | Loading compressed data into a Hive table

Hello, everyone. Thanks for returning for the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to load data into a Hive table from a SELECT query. In this tutorial, we are going to see how to load the compressed data from into the Hive table. Let us begin then. The aboveContinue reading “Post 41 | HDPCD | Loading compressed data into a Hive table”

Post 40 | HDPCD | Load data into hive table as a result of a query

Hi, everyone. Thanks for coming back for one more tutorial in this HDPCD certification series. In the last tutorial, we saw how to load data into a Hive table from an HDFS directory. In this tutorial, we are going to see how to load the data into a Hive table as a result of a Hive query.Continue reading “Post 40 | HDPCD | Load data into hive table as a result of a query”

Post 37 | HDPCD | Specifying delimiter of a Hive table

Hello, everyone. Thanks for coming back for one more tutorial in this HDPCD certification series. In the last tutorial, we saw how to specify the storage format of a Hive table. In this tutorial, we are going to see how to specify the delimiter of a Hive table. We are going to follow the processContinue reading “Post 37 | HDPCD | Specifying delimiter of a Hive table”

Post 25 | HDPCD | Register a Jar file of UDF in Apache Pig

Hello, everyone. Thanks for coming back again to continue with this certification series. In the last tutorial, we saw how to run any pig script with TEZ as the execution mode. In this tutorial, we are going to see how to register a JAR file to use the User Defined Function written and packages inside it.Continue reading “Post 25 | HDPCD | Register a Jar file of UDF in Apache Pig”

Post 14 | HDPCD | Data Transformation to match Hive Schema using Apache Pig

The last tutorial talked about transforming data by reducing the number of columns from input to output records. This tutorial is kind of similar, in which, we are going to take the data transformation process one step further. This tutorial focuses on matching your input records with the Hive table schema. This includes splitting theContinue reading “Post 14 | HDPCD | Data Transformation to match Hive Schema using Apache Pig”

Post 13 | HDPCD | Data Transformation using Apache Pig

In the previous tutorial, we saw how to load the data from Apache Hive to Apache Pig. If you remember, we used HCatalog for performing that operation. In this tutorial, we are going to see the process of doing the data transformation using Apache Pig. The process of data transformation itself is too involved andContinue reading “Post 13 | HDPCD | Data Transformation using Apache Pig”