Discovering Knowledge in Data: An Introduction to Data Mining, Second Edition
- 5h 27m
- Chantal D. Larose, Daniel T. Larose
- John Wiley & Sons (US)
- 2014
The field of data mining lies at the confluence of predictive analytics, statistical analysis, and business intelligence. Due to the ever-increasing complexity and size of data sets and the wide range of applications in computer science, business, and health care, the process of discovering knowledge in data is more relevant than ever before.
This book provides the tools needed to thrive in today’s big data world. The author demonstrates how to leverage a company’s existing databases to increase profits and market share, and carefully explains the most current data science methods and techniques. The reader will “learn data mining by doing data mining”. By adding chapters on data modelling preparation, imputation of missing data, and multivariate statistical analysis, Discovering Knowledge in Data, Second Edition remains the eminent reference on data mining.
- The second edition of a highly praised, successful reference on data mining, with thorough coverage of big data applications, predictive analytics, and statistical analysis.
- Includes new chapters on Multivariate Statistics, Preparing to Model the Data, and Imputation of Missing Data, and an Appendix on Data Summarization and Visualization
- Offers extensive coverage of the R statistical programming language
- Contains 280 end-of-chapter exercises
About the Authors
Daniel T. Larose earned his PhD in Statistics at the University of Connecticut. He is Professor of Mathematical Sciences and Director of the Data Mining programs at Central Connecticut State University. His consulting clients have included Microsoft, Forbes Magazine, the CIT Group, KPMG International, Computer Associates, and Deloitte, Inc. This is Larose’s fourth book for Wiley.
Chantal D. Larose is a PhD candidate in Statistics at the University of Connecticut. Her research focuses on the imputation of missing data and model-based clustering. She has taught undergraduate statistics since 2011, and has done statistical consulting for DataMiningConsultant.com, LLC.
In this Book
-
An Introduction To Data Mining
-
Data Preprocessing
-
Exploratory Data Analysis
-
Univariate Statistical Analysis
-
Multivariate Statistics
-
Preparing To Model the Data
-
K-Nearest Neighbor Algorithm
-
Decision Trees
-
Neural Networks
-
Hierarchical and k-Means Clustering
-
Kohonen Networks
-
Association Rules
-
Imputation of Missing Data
-
Model Evaluation Techniques