Data Science Statistics: Using Python to Compute & Visualize Statistics

Data Science    |    Beginner
  • 10 videos | 1h 15m 56s
  • Includes Assessment
  • Earns a Badge
Rating 4.6 of 62 users Rating 4.6 of 62 users (62)
Learners continue their exploration of data science in this 10-video course, which deals with using NumPy, Pandas, and SciPy libraries to perform various statistical summary operations on real data sets. This beginner-level course assumes some prior experience with Python programming and an understanding of basic statistical concepts such as mean, standard deviation, and correlation. The course opens by exploring different ways to visualize data by using the Matplotlib library, including univariate and bivariate distributions. Next, you will move to computing descriptor statistics for distributions, such as variance and standard error, by using the NumPy, Pandas, and SciPy libraries. Learn about the concept of the z-score, in which every value in a distribution is expressed in terms of the number of standard deviations from the mean value. Then cover the computation of the z-score for a series using SciPy. In the closing exercise, you will make use of the matplotlib data visualization library through three points represented by given coordinates, then enumerate all of the details which are conveyed in a Boxplot.

WHAT YOU WILL LEARN

  • Create and configure simple graphs with lines and markers using the matplotlib data visualization library
    Use the numpy library to manipulate arrays and the pandas library to load and analyze a dataset
    Generate histograms and pie charts to analyze distributions and create scatter plots to plot the relationship between two variables in a dataset
    Apply python native functions such as max() and sum() to summarize distributions and visualize these values using matplotlib
    Use numpy to compute statistics such as the mean and median on your data
  • Calculate statistics such as the mode and standard error of mean using the scipy library and compute more statistics such as variance and values at various percentiles using numpy
    Use numpy to compute the correlation and covariance of two distributions and visualize their relationship with scatterplots
    Standardize a distribution to express its values as z-scores and use pandas to generate a correlation and covariance matrix for your dataset
    Create and configure a graph using matplotlib, enumerate the details conveyed in a boxplot, compute statistical values using the numpy function, and compute the correlations between all pairs of columns in a pandas dataframe

IN THIS COURSE

  • 2m 18s
  • 9m 17s
    In this video, you will learn how to create and configure simple graphs with lines and markers using the Matplotlib data visualization library. FREE ACCESS
  • Locked
    3.  Analyzing Data Using NumPy and Pandas
    7m 59s
    In this video, you will learn how to use the NumPy library to manipulate arrays and the Pandas library to load and analyze data sets. FREE ACCESS
  • Locked
    4.  Visualizing Univariate and Bivariate Distributions
    9m 37s
    In this video, you will learn how to generate histograms and pie charts to analyze distributions, and create scatter plots to plot the relationship between two variables in a dataset. FREE ACCESS
  • Locked
    5.  Summary Statistics Using Native Python Functions
    8m 41s
    In this video, you will apply Python native functions such as max() and sum() to summarize distributions and visualize these values using Matplotlib. FREE ACCESS
  • Locked
    6.  Summary Statistics Using NumPy
    9m 12s
    In this video, you will use NumPy to compute statistics such as the mean and median. FREE ACCESS
  • Locked
    7.  Summary Statistics Using the SciPy Library
    8m 16s
    In this video, you will calculate statistics such as the mode and standard error of the mean using the SciPy library and compute more statistics such as variance and values at various percentiles using NumPy. FREE ACCESS
  • Locked
    8.  Correlation and Covariance
    8m 1s
    In this video, you will learn how to use NumPy to compute the correlation and covariance of two distributions and visualize their relationship with scatterplots. FREE ACCESS
  • Locked
    9.  Z-score
    6m 57s
    In this video, you will standardize a distribution to express its values as z-scores and use Pandas to generate a correlation matrix and a covariance matrix for your dataset. FREE ACCESS
  • Locked
    10.  Exercise: Compute and Visualize Statistics
    5m 39s
    In this video, you will create and configure a graph using Matplotlib, enumerate the details conveyed in a Boxplot, compute statistical values using the NumPy function, and compute the correlations between all pairs of columns in a Pandas dataframe. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 4.5 of 935 users Rating 4.5 of 935 users (935)
Rating 3.9 of 40 users Rating 3.9 of 40 users (40)
Rating 4.5 of 275 users Rating 4.5 of 275 users (275)