Post 6 | ML | Data Preprocessing – Part 4

Hello, everyone. Welcome to the Part 4 of the Data Preprocessing of the Machine Learning tutorials. In the last tutorial, we saw how to impute the Missing Data in both Python and R. In this tutorial, we are going to see how to deal with the qualitative entries in the given data. The following infographics show our … Continue reading Post 6 | ML | Data Preprocessing – Part 4

Post 5 | ML | Data Preprocessing – Part 3

Hello, everyone. Thanks for coming back for the third part of the Data Preprocessing section of the Machine Learning tutorial series. In the last tutorial, i.e. Part 2, we saw how to import the downloaded dataset. In this tutorial, we are going to see how to impute the missing data in the input data. The … Continue reading Post 5 | ML | Data Preprocessing – Part 3

Post 4 | ML | Data Preprocessing – Part 2

Hello everyone, thanks for coming back to the next tutorial in Data Preprocessing step of Machine Learning tutorials. Just to refresh your memory, in the last tutorial i.e. Part 1 of Data Preprocessing, we saw how to download the dataset and import the required libraries for performing required operations. In this tutorial, we are going to see how … Continue reading Post 4 | ML | Data Preprocessing – Part 2

Post 2 | Installations – R and Python

Hello, everyone, we are going to start off learning the concepts of Machine Learning. If you are following my blog posts on Hadoop and Big Data Analytics, then you will come to know I do give more importance on performing the hands-on exercises. Same is going to be the case for these tutorials. Here, we … Continue reading Post 2 | Installations – R and Python

Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It's been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love to … Continue reading Post 52 | HDPCD | The conclusion

Post 48 | HDPCD | Printing the execution plan of a Hive query

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest tutorials in this certification series. In … Continue reading Post 48 | HDPCD | Printing the execution plan of a Hive query

Post 43 | HDPCD | Delete a row in a Hive table

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to update a row in a Hive table. In this tutorial, we are going to see how to delete a row in the Hive table. It is quite interesting to see that Hive supports ACID operations … Continue reading Post 43 | HDPCD | Delete a row in a Hive table

Post 41 | HDPCD | Loading compressed data into a Hive table

Hello, everyone. Thanks for returning for the next tutorial in the HDPCD certification series. In the last tutorial, we saw how to load data into a Hive table from a SELECT query. In this tutorial, we are going to see how to load the compressed data from into the Hive table. Let us begin then. The above … Continue reading Post 41 | HDPCD | Loading compressed data into a Hive table

Post 40 | HDPCD | Load data into hive table as a result of a query

Hi, everyone. Thanks for coming back for one more tutorial in this HDPCD certification series. In the last tutorial, we saw how to load data into a Hive table from an HDFS directory. In this tutorial, we are going to see how to load the data into a Hive table as a result of a Hive query. … Continue reading Post 40 | HDPCD | Load data into hive table as a result of a query