Building ML Training Sets: Preprocessing Datasets for Classification
Machine Learning
| Beginner
- 6 videos | 43m 46s
- Includes Assessment
- Earns a Badge
In this course, learners can explore how to implement machine learning scaling techniques such as standardizing and normalizing on continuous data and label encoding on the target, in order to get the best out of machine learning algorithms. Examine dimensionality reduction by using Principal Component Analysis (PCA). Start this 6-video course by using Pandas library to load a CSV data set into a data frame and scale continuous features by using a standard scaler. You will then learn how to build and evaluate a support vector classifier in scikit-learn; use Pandas and Seaborn to generate a heat map; and spot the correlations between features in a data set. Discover how to apply the technique of PCA to reduce the number of dimensions in your input data and obtain the explained variance of each principal component. In the course's final tutorial, you will explore how to apply normalization and PCA on data sets and build a classification model with the principal components of scaled data. The concluding exercise involves processing data for classification.
WHAT YOU WILL LEARN
-
Use the pandas library to load a csv dataset into a dataframe and scale the continuous features using a standard scalerBuild and evaluate a support vector classifier in scikit-learn, use pandas and seaborn to generate a heatmap, and spot the correlations between features in a datasetApply the technique of principal component analysis to reduce the number of dimensions in your input data and obtain the explained variance of each principal component
-
Apply normalization and pca on a dataset and build a classification model with the principal components of scaled dataEncode the target column of a dataset containing certain values, identify the features of normalization, enumerate reasons for using pca, split data into training and test sets using scikit-learn, identify one method of viewing correlations in a dataset using pandas and seaborn
IN THIS COURSE
-
2m 54s
-
8m 13sDuring this video, you will learn how to use the Pandas library to load a CSV dataset into a dataframe and scale the continuous features using a standard scaler. FREE ACCESS
-
6m 42sLearn how to build and evaluate a support vector classifier in scikit-learn, use Pandas and Seaborn to generate a heatmap, and identify the correlations between features in a dataset. FREE ACCESS
-
7m 33sIn this video, you will learn how to apply the technique of Principal Component Analysis to reduce the number of dimensions in your input data and obtain the explained variance of each principal component. FREE ACCESS
-
9m 22sDuring this video, you will learn how to apply normalization and PCA to a dataset and build a classification model with the principal components of scaled data. FREE ACCESS
-
9m 2sFind out how to encode the target column of a dataset containing certain values, identify the features of Normalization, enumerate reasons for using PCA, split data into training and test sets using scikit-learn, identify one method of viewing correlations in a dataset using Pandas and Seaborn. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.YOU MIGHT ALSO LIKE
Audiobook
Ensemble Methods for Machine Learning