Data Preparation 101
Learn data preparation steps required for machine learning model building
16 Tutorials
0 Exercises
Beginner Level
100% Online
Self-paced
Our Alumni Work At
About this course
Contributors & Instructors
What you will learn?
Course Content
1. Introduction
2
Data Preparation - What and Why
Understanding and Assessing the Data
2. Hands-on Data Preparation
6
Exploring the Data
Assessing the Data Quality for Numeric Variables
Using Pandas GUI for Assessing Data Quality
Assessing the Data Quality for Categorical Variables
Data Cleaning - Data Types and Missing Values
Data Cleaning - Outliers
3. Other Parts of Data Preparation
5
Duplicate Data
Feature Engineering
Another Example Dataset
Train, Validation and Test Split
Class Imbalance
4. Additional Resources
3
Past DPhi Challenges on Data Preparation
Video Slides
Notebook