Introduction to Supervised Learning | AI Planet (formerly DPhi)

Learning Objectives

What is Machine Learning?
Machine Learning Categorization
Classification and Regression

What is Machine Learning?

Machine Learning Categorization

Supervised Learning Algorithms

Let's talk about the datasets that have both input and target variables (labels for the data)—ranging from predicting a person's survival rate in Titanic Dataset, where Survival Rate is already given, to predicting the House Price according to house characteristics where the house prices are provided.

The algorithms that work on such datasets are known as Supervised Learning Algorithms.

It is called supervised learning because the process of an algorithm learning from the training dataset can be thought of as a teacher supervising the learning process. We know the correct answers; the algorithm iteratively makes predictions on the training data and is corrected by the teacher. Learning stops when the algorithm achieves an acceptable level of performance.

Unsupervised Learning Algorithms

Unsupervised learning is where you have unlabeled data (or no target variable) in the dataset.

Unsupervised Learning Algorithms aim to find some structure in the dataset.

These are called unsupervised learning because, unlike supervised learning, there are no correct answers and there is no teacher. Algorithms are left on their own to discover and present the interesting structure in the data.

Reinforcement Learning Algorithms

A robot takes a big step forward, then falls. The next time, it takes a smaller step and is able to hold its balance. The robot tries variations like this many times; eventually, it learns the right size of steps to take and walks steadily. It has succeeded.

What we see here is called reinforcement learning. The robot learns how to walk based on reward (staying on balance) and punishment (falling). This feedback is considered "reinforcement" for doing or not doing an action.

In simple terms, reinforcement learning is learning the best actions based on reward or punishment.

Types of Supervised Learning Algorithms

Supervised learning can be further divided into two types:

Classification
Regression

Classification vs Regression

To decide whether to use a regression or classification model, the first question you should ask yourself is:

Does your target variable have a continuous value, or is it discrete (binary or multi-class)?

Regression

If your answer is continuous values, you're dealing with Regression.

If you're trying to predict quantities like height, income, price, or scores, you should use a model that will output a continuous number.

So if your objective is to determine tomorrow's temperature, you should use a regression model.

Classification

Let's come to the second case, where you can see that the target variable is divided into classes. You'll be using Classification.

When the number of classes is 2, it is known as Binary Classification. E.g., will it be hot or cold tomorrow is a binary classification problem with two categories: Hot and Cold.
When it is more than 2, it is known as Multi-Class Classification. E.g., classifying movies as Good, Average, or Bad according to reviews.

The above image shows that the classification line divides the data into two parts or classes - red and blue. On the other hand, the regression line is going along the direction of data and not segregating it.

Understanding your target variable's characteristics is essential before you begin running models and forming predictions.

In this course, we'll be focusing on Regression.

Slide Download Link

You can download the slides for this topic from here.