Post 5 | ML | Data Preprocessing – Part 3

Hello, everyone. Thanks for coming back for the third part of the Data Preprocessing section of the Machine Learning tutorial series. In the last tutorial, i.e. Part 2, we saw how to import the downloaded dataset. In this tutorial, we are going to see how to impute the missing data in the input data. TheContinue reading “Post 5 | ML | Data Preprocessing – Part 3”

Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It’s been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love toContinue reading “Post 52 | HDPCD | The conclusion”

Post 48 | HDPCD | Printing the execution plan of a Hive query

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the┬álast┬átutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to print the execution plan of a Hive query. Let us begin, then. This is one of the simplest┬átutorials in this certification series. InContinue reading “Post 48 | HDPCD | Printing the execution plan of a Hive query”

Post 1 | Machine Learning | Introduction

Hello, people. In this new tutorial series, we are going to talk about the different aspects of the Machine Learning. As an aspiring Data Scientist, I always wanted to get my hands dirty with the concepts of Machine Learning and the Summar Break gave me exactly what I wanted – “TIME TO LEARN MACHINE LEARNINGContinue reading “Post 1 | Machine Learning | Introduction”