Data Science For Dummies
- 5h 44m
- Lillian Pierson
- John Wiley & Sons (US)
- 2015
Jobs in data science abound, but few people have the data science skills needed to fill these increasingly important roles in organizations. Data Science For Dummies is the perfect starting point for IT professionals and students interested in making sense of their organization’s massive data sets and applying their findings to real-world business scenarios. From uncovering rich data sources to managing large amounts of data within hardware and software limitations, ensuring consistency in reporting, merging various data sources, and beyond, you’ll develop the know-how you need to effectively interpret data and tell a story that can be understood by anyone in your organization.
- Provides a background in data science fundamentals before moving on to working with relational databases and unstructured data and preparing your data for analysis
- Details different data visualization techniques that can be used to showcase and summarize your data
- Explains both supervised and unsupervised machine learning, including regression, model validation, and clustering techniques
- Includes coverage of big data processing tools like MapReduce, Hadoop, Dremel, Storm, and Spark
It’s a big, big data world out there – let Data Science For Dummies help you harness its power and gain a competitive edge for your organization.
About the Author
Lillian Pierson, P.E. is an entrepreneurial data scientist and professional environmental engineer. She's the founder of Data-Mania, a start-up that focuses mainly on web analytics, data-driven growth services, data journalism, and data science training services. She also covers the topics of data science, analytics, and statistics for prominent organizations like IBM and UBM.
In this Book
-
Foreword
-
Introduction
-
Wrapping Your Head around Data Science
-
Exploring Data Engineering Pipelines and Infrastructure
-
Applying Data Science to Business and Industry
-
Introducing Probability and Statistics
-
Clustering and Classification
-
Clustering and Classification with Nearest Neighbor Algorithms
-
Mathematical Modeling in Data Science
-
Modeling Spatial Data with Statistics
-
Following the Principles of Data Visualization Design
-
Using D3.js for Data Visualization
-
Web-Based Applications for Visualization Design
-
Exploring Best Practices in Dashboard Design
-
Making Maps from Spatial Data
-
Using Python for Data Science
-
Using Open Source R for Data Science
-
Using SQL in Data Science
-
Software Applications for Data Science
-
Using Data Science in Journalism
-
Delving into Environmental Data Science
-
Data Science for Driving Growth in E-Commerce
-
Using Data Science to Describe and Predict Criminal Activity
-
Ten Phenomenal Resources for Open Data
-
Ten (or So) Free Data Science Tools and Applications