Handling Missing Values and Outliers in Data
Machine Learning
| Intermediate
- 13 videos | 1h 22m 6s
- Earns a Badge
Managing data quality involves mastering techniques to address missing values and outliers to make datasets clean and reliable. This process includes utilizing strategies such as multivariate imputation, outlier detection, and visualization. In this course, learn the fundamentals of data preparation, identify missing values and outliers in datasets, and explore strategies to detect and manage them effectively. Next, discover how to load and prepare data for cleaning, address missing data by dropping records or imputing values, and implement multivariate imputation to achieve more reliable datasets. Finally, explore methods to identify and visualize outliers using box plots and utilize the interquartile range (IQR) and z-score techniques to detect and cap outliers. After taking this course, you will be able to handle missing values and outliers in data.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseDescribe data preprocessing and feature engineering in machine learning (ml) workflowsIdentify how to handle missing values and outliers in data analysisRecognize the impact of outliers on statistical and machine learning (ml) modelsOutline various methods for detecting, identifying, and managing outliersLoad in a dataset and observe missing values and duplicatesPerform data cleaning using dropna and the simpleimputer for handling missing values
-
Implement multivariate imputation using the iterativeimputer classDetect and cap outliers using interquartile ranges and box plotsIdentify outliers using the interquartile range (iqr) methodDetect and cap outliers using iqrUse the z-score technique to identify and handle outliersSummarize the key concepts covered in this course
IN THIS COURSE
-
58sIn this video, we will discover the key concepts covered in this course. FREE ACCESS
-
8m 1sAfter completing this video, you will be able to describe data preprocessing and feature engineering in machine learning (ML) workflows. FREE ACCESS
-
8m 12sUpon completion of this video, you will be able to identify how to handle missing values and outliers in data analysis. FREE ACCESS
-
6m 28sThrough this video, you will be able to recognize the impact of outliers on statistical and machine learning (ML) models. FREE ACCESS
-
5m 25sIn this video, we will outline various methods for detecting, identifying, and managing outliers. FREE ACCESS
-
8m 4sLearn how to load in a dataset and observe missing values and duplicates. FREE ACCESS
-
10m 46sIn this video, find out how to perform data cleaning using dropna and the SimpleImputer for handling missing values. FREE ACCESS
-
5m 14sDiscover how to implement multivariate imputation using the IterativeImputer class. FREE ACCESS
-
7m 28sIn this video, you will learn how to detect and cap outliers using interquartile ranges and box plots. FREE ACCESS
-
7m 5sFind out how to identify outliers using the interquartile range (IQR) method. FREE ACCESS
-
5m 36sIn this video, discover how to detect and cap outliers using IQR. FREE ACCESS
-
7m 23sIn this video, learn how to use the z-score technique to identify and handle outliers. FREE ACCESS
-
1m 26sIn this video, we will summarize the key concepts covered in this course. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.YOU MIGHT ALSO LIKE

Audiobook
Feature Engineering Bookcamp