Statistics for Big Data for Dummies
- 5h 1m
- Alan Anderson, David Semmelroth
- John Wiley & Sons (US)
- 2015
Learn to:
- Collect, clean, and interpret data
- Effectively communicate data analysis
- Make good predictions
Big data making you dizzy? Relax—here's what it's all about
Big data figures into everything from weather forecasting to political polling. Don't let it give you a big headache; use this friendly book to learn about it in manageable, bite-size chunks. You'll get a handle on the statistical methods used when working with big data, applications for it, ways to organize and check data, and a whole lot more.
- Solving the big mystery—find out what big data is, characteristics that define it, how it's used, and what it makes possible
- How to handle it — explore statistical techniques used with big data, including probability distributions, regression analysis, time series analysis, and forecasting techniques
- Getting graphical — learn how big data can be analyzed with graphical techniques and how to identify valid, useful, and understandable patterns in data
- A variable approach — examine key univariate and multivariate statistical techniques for analyzing data
- Thinking ahead — discover techniques for forecasting the future values of a dataset
- There's a tool for that — learn about the best software packages and programming tools for analyzing statistical data
Open the book and find:
- Ways to extract previously unknown information from a database
- Tips for data collection and cleaning
- Techniques for analyzing time series data
- How to check data for missing information
- What to do with outliers in a dataset
- Some surprising uses for big data
- An overview of modeling techniques
About the Authors
Alan Anderson, PhD, is a professor of economics and finance at Fordham University and New York University. He's a veteran economist, risk manager, and fixed income analyst.
David Semmelroth is an experienced data analyst, trainer, and statistics instructor who consults on customer databases and database marketing.
In this Book
-
Introduction
-
What is Big Data and What Do You Do with It?
-
Characteristics of Big Data—The Three Vs
-
Using Big Data—The Hot Applications
-
Understanding Probabilities
-
Basic Statistical Ideas
-
Dirty Work—Preparing Your Data for Analysis
-
Figuring the Format—Important Computer File Formats
-
Checking Assumptions—Testing for Normality
-
Dealing with Missing or Incomplete Data
-
Sending Out a Posse—Searching for Outliers
-
An Overview of Exploratory Data Analysis (EDA)
-
A Plot to Get Graphical—Graphical Techniques
-
You're the Only Variable for Me—Univariate Statistical Techniques
-
To All the Variables We've Encountered—Multivariate Statistical Techniques
-
Regression Analysis
-
When You've Got the Time—Time Series Analysis
-
Using Your Crystal Ball—Forecasting with Big Data
-
Crunching Numbers—Performing Statistical Analysis on Your Computer
-
Seeking Free Sources of Financial Data
-
Ten (or So) Best Practices in Data Preparation
-
Ten (or So) Questions Answered by Exploratory Data Analysis (EDA)