Hello, everyone. Thanks for coming back for the third part of the Data Preprocessing section of the Machine Learning tutorial series. In the last tutorial, i.e. Part 2, we saw how to import the downloaded dataset. In this tutorial, we are going to see how to impute the missing data in the input data. The … Continue reading Post 5 | ML | Data Preprocessing – Part 3
Tag: Data Loader and Transformer
Post 4 | ML | Data Preprocessing – Part 2
Hello everyone, thanks for coming back to the next tutorial in Data Preprocessing step of Machine Learning tutorials. Just to refresh your memory, in the last tutorial i.e. Part 1 of Data Preprocessing, we saw how to download the dataset and import the required libraries for performing required operations. In this tutorial, we are going to see how … Continue reading Post 4 | ML | Data Preprocessing – Part 2
Post 52 | HDPCD | The conclusion
Hi everyone. Finally, we have reached the end of this tutorial series. It's been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love to … Continue reading Post 52 | HDPCD | The conclusion
Post 51 | HDPCD | Set Hadoop or Hive Configuration property
Hello, everyone. Welcome to the last technical tutorial in the HDPCD certification series. It's funny! This beautiful journey is coming to an end. In the last tutorial, we saw how to sort the output of a Hive query across multiple reducers. In this tutorial, we are going to see how to set a Hadoop or Hive configuration … Continue reading Post 51 | HDPCD | Set Hadoop or Hive Configuration property
Post 50 | HDPCD | Order Hive query output across multiple reducers
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to run a subquery within a Hive query. Let us begin, then. The following infographics show the step-by-step process of performing this operation. From … Continue reading Post 50 | HDPCD | Order Hive query output across multiple reducers
Post 49 | HDPCD | Using a subquery within a Hive query
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to run a subquery within a Hive query. Let us begin, then. As you can see from the above screenshot, the process of performing … Continue reading Post 49 | HDPCD | Using a subquery within a Hive query
Post 48 | HDPCD | Printing the execution plan of a Hive query
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest tutorials in this certification series. In … Continue reading Post 48 | HDPCD | Printing the execution plan of a Hive query
Post 47 | HDPCD | Run a Hive query using Vectorization
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to run a Hive Query using TeZ execution engine. In this tutorial, we are going to see how to run a Hive Query using Vectorization. Let us begin, then. Before starting off with the objective of this tutorial, let … Continue reading Post 47 | HDPCD | Run a Hive query using Vectorization
Post 45 | HDPCD | Join two Hive tables
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to insert a new row into a Hive table. In this tutorial, we are going to see how to join two Hive tables. Let us begin, then. The above info-graphics show the step by step process … Continue reading Post 45 | HDPCD | Join two Hive tables
Post 44 | HDPCD | Insert a row in the Hive table
Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to delete a row in a Hive table. In this tutorial, we are going to see how to insert a row in the Hive table. It is quite interesting to see that Hive supports ACID operations … Continue reading Post 44 | HDPCD | Insert a row in the Hive table