The Importance of Human Interpretable Machine Learning

Learning Objectives

Need for Explainable AI
What is Explainable AI
The Black Box
What is causing the transition?
Model Interpretation
Explainable AI constituents

Need for Explainable AI

Let’s say you are a reputed doctor to whom a Data Scientist comes and says -

“Hi, I’ve built a model to predict if a person has skin cancer or not. You should set it up in your hospital.” Will you completely trust the person?

Many questions might arise in your mind -

On what basis does this model classify an image as cancerous?
How reliable is it?
Will it be more helpful than the cancer experts we have here?

Even the Data Scientist might not be able to answer your questions satisfactorily. So, what should we do to judge how trustworthy a model is? Can we help the Data Scientist out?

Explainable AI (XAI)

Here comes Explainable AI!

Simply going by the name, it explains what exactly is happening behind the fancy Artificial Intelligence models.

A formal definition:

“Explainable AI is a set of tools and frameworks to help you understand and interpret predictions made by your machine learning models. With it, you can debug and improve model performance, and help others understand your models’ behavior”.

How Explainable AI makes things easier

The Black Box

Since the start of the path to the mainstream adoption of AI, it has been common practice to train machine learning models with a black-box approach — as long as they give us good results, it doesn’t matter if we can’t explain them.

Now, the focus is slowly turning towards the transparency and explicability of these models. We want to know WHY they come up with the predictions they output.

Explainable AI is one of the ‘hottest’ topics in the field of Machine Learning nowadays.

What is causing the transition?

This transition is coming for various reasons:

Understanding what happens when Machine Learning models make predictions could help speed up the widespread adoption of these systems. New technologies always take time to mature, but it helps if they are understood.
It makes users increasingly comfortable with the technology and removes the magical veil which surrounds AI. Having users that trust the systems that they are using is of utmost importance.
For some sectors like insurance or banking, there are sometimes company-level or even legislative restrictions that make it a must for the models that these companies use to be explainable.
In some other critical areas, like medicine, where AI can have a significant impact and make tremendous improvements to our quality of life, people need to be able to trust the used models without a hint of doubt. Having a Netflix recommendation system that sometimes outputs strange predictions might not have a huge impact, but in the case of medical diagnosis, abnormal/wrong predictions could be fatal. Hence, providing more information than just the prediction allows the users to decide whether they trust the prediction.
Explainable models can help their users make better use of the outputs such models give, making them have even more impact on the business, research, or decision making. We should never forget that, like any other technology, AI aims to improve our quality of life, so the more benefit we can extract from it, the better.

Model Interpretation

Model Interpretation can make or break an industry’s machine learning project in the real world. It helps us get closer to explainable artificial intelligence (XAI).

Interpretability is the degree to which a human can understand the cause of a decision.

Model interpretation tries to understand and explain the decisions taken by the model, i.e., the what, why, and how. The three most important aspects of model interpretation are:

Transparency
The ability to question
The ease of understanding

What does it mean to interpret a model?

Let’s start by precisely defining what it means to interpret a model. At a very high level, we want to understand what motivated a certain prediction.

For instance, let’s use the problem from the XGBoost documentation, where, given the age, gender, and occupation of an individual, I want to predict whether or not they will like computer games:

In this case, my input features are age, gender, and occupation. I want to know how these features impacted the model’s prediction that someone would like computer games.

Global and Local Interpretation

However, there are two different ways to interpret this:

On a global level.

Which features did the algorithm find most predictive of the entire dataset?

XGBoost’s get_score() function - which counts how many times a feature was used to split the data – is an example of considering global feature importance since it looks at what was learned from all the data.
On a local level.

Maybe, across all individuals, age was the most important feature, and younger people are much more likely to like computer games.

But if Frank is a 50-year-old who works as a video game tester, his occupation will likely be much more significant than his age in determining whether he likes computer games. Identifying which features were most important for Frank involves finding feature importances on a ‘local’ – individual – level.