Post 7 | ML| Data Preprocessing – Part 6

Hello, everyone. Welcome to the last tutorial in the Data Preprocessing portion of the Machine Learning tutorials. In the last tutorial, we saw how to create the TRAINING and TEST data sets for model building purposes. In this tutorial, we are going to see why and how to perform the Feature Scaling. Let us begin, then. To refreshContinue reading “Post 7 | ML| Data Preprocessing – Part 6”

Post 7 | ML | Data Preprocessing – Part 5

Hello, everyone. Thanks for joining me in this 5th tutorial of the Data Preprocessing part of the Machine Learning tutorials. In the last tutorial, we saw how to convert the CATEGORICAL VARIABLES from the STRING format to an INTEGER format. In this tutorial, we are going a step ahead and are going to split the original dataContinue reading “Post 7 | ML | Data Preprocessing – Part 5”

Post 5 | ML | Data Preprocessing – Part 3

Hello, everyone. Thanks for coming back for the third part of the Data Preprocessing section of the Machine Learning tutorial series. In the last tutorial, i.e. Part 2, we saw how to import the downloaded dataset. In this tutorial, we are going to see how to impute the missing data in the input data. TheContinue reading “Post 5 | ML | Data Preprocessing – Part 3”

Post 4 | ML | Data Preprocessing – Part 2

Hello everyone, thanks for coming back to the next tutorial in Data Preprocessing step of Machine Learning tutorials. Just to refresh your memory, in the last tutorial i.e. Part 1 of Data Preprocessing, we saw how to download the dataset and import the required libraries for performing required operations. In this tutorial, we are going to see howContinue reading “Post 4 | ML | Data Preprocessing – Part 2”

Post 3 | ML | Data Preprocessing – Part 1

In the last tutorial, we saw the installation steps for both R and Python along with their respective IDEs. In this tutorial, we are going to start our actual journey of Machine Learning. We are going to start off with the Data Preprocessing part, which is one of the most important aspects of the Machine Learning. WeContinue reading “Post 3 | ML | Data Preprocessing – Part 1”

Post 1 | ML | Introduction

Hello, people. In this new tutorial series, we are going to talk about the different aspects of the Machine Learning. As an aspiring Data Scientist, I always wanted to get my hands dirty with the concepts of Machine Learning and the Summar Break gave me exactly what I wanted – “TIME TO LEARN MACHINE LEARNINGContinue reading “Post 1 | ML | Introduction”

Post 52 | HDPCD | The conclusion

Hi everyone. Finally, we have reached the end of this tutorial series. It’s been so long. We started this journey together on January 15th, 2017, and, 276 days later this beautiful journey is coming to an end. But, we do not need to worry, because, I am working on something new and would love toContinue reading “Post 52 | HDPCD | The conclusion”

Post 51 | HDPCD | Set Hadoop or Hive Configuration property

Hello, everyone. Welcome to the last technical tutorial in the HDPCD certification series. It’s funny! This beautiful journey is coming to an end. In the last tutorial, we saw how to sort the output of a Hive query across multiple reducers. In this tutorial, we are going to see how to set a Hadoop or Hive configurationContinue reading “Post 51 | HDPCD | Set Hadoop or Hive Configuration property”

Post 50 | HDPCD | Order Hive query output across multiple reducers

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to run a subquery within a Hive query. Let us begin, then. The following infographics show the step-by-step process of performing this operation. FromContinue reading “Post 50 | HDPCD | Order Hive query output across multiple reducers”

Post 49 | HDPCD | Using a subquery within a Hive query

Hello, everyone. Welcome to one more tutorial in the HDPCD certification series. In the last tutorial, we saw how to enable vectorization in Hive. In this tutorial, we are going to see how to run a subquery within a Hive query. Let us begin, then. As you can see from the above screenshot, the process of performingContinue reading “Post 49 | HDPCD | Using a subquery within a Hive query”