Data Science, Analytics and Machine Learning with R, First Edition
- 10h 24m
- Luiz Paulo Favero, Patricia Belfiore, Rafael de Freitas Souza
- Elsevier Science and Technology Books, Inc.
- 2023
Data Science, Analytics and Machine Learning with R explains the principles of data mining and machine learning techniques and accentuates the importance of applied and multivariate modeling. The book emphasizes the fundamentals of each technique, with step-by-step codes and real-world examples with data from areas such as medicine and health, biology, engineering, technology and related sciences. Examples use the most recent R language syntax, with recognized robust, widespread and current packages. Code scripts are exhaustively commented, making it clear to readers what happens in each command. For data collection, readers are instructed how to build their own robots from the very beginning.
In addition, an entire chapter focuses on the concept of spatial analysis, allowing readers to build their own maps through geo-referenced data (such as in epidemiologic research) and some basic statistical techniques. Other chapters cover ensemble and uplift modeling and GLMM (Generalized Linear Mixed Models) estimations, both linear and nonlinear.
- Presents a comprehensive and practical overview of machine learning, data mining and AI techniques for a broad multidisciplinary audience
- Serves readers who are interested in statistics, analytics and modeling, and those who wish to deepen their knowledge in programming through the use of R
- Teaches readers how to apply machine learning techniques to a wide range of data and subject areas
- Presents data in a graphically appealing way, promoting greater information transparency and interactive learning
About the Author
Dr. Fávero is a Full Professor at the Economics, Business Administration and Accounting College and at the Polytechnic School of the University of Sao Paulo (FEAUSP and EPUSP), where he teaches Data Science, Data Analysis, Multivariate Modeling, Machine and Deep Learning and Operational Research to undergraduate, Master’s and Doctorate students. He has a Post-Doctorate degree in Data Analysis and Econometrics from Columbia University in New York. He is a tenured Professor by FEA/USP (with greater focus on Quantitative Modeling). He has a degree in Engineering from USP Polytechnic School, a post-graduate degree in Business Administration from Getúlio Vargas Foundation (FGV/SP), and he has received the titles of Master and PhD in Data Science and Quantitative Methods applied to Organizational Economics from FEA/USP. He is a Visiting Professor at the Federal University of Sao Paulo (UNIFESP), Dom Cabral Foundation, Getúlio Vargas Foundation, FIA, FIPE and MONTVERO. He has authored or co-authored 9 books and he is the founder and former editor-in-chief of the International Journal of Multivariate Data Analysis. He is member and founder of the Latin American Academy of Data Science. He is a consultant to companies operating in sectors such as retail, industry, mining, banks, insurance and healthcare, with the use of Data Analysis, Machine and Deep Learning, Big Data and AI platforms, such as R, Python, SAS, Stata and IBM SPSS. Dr. Fávero is a Full Professor at the Economics, Business Administration and Accounting College and at the Polytechnic School of the University of Sao Paulo (FEAUSP and EPUSP), where he teaches Data Science, Data Analysis, Multivariate Modeling, Machine and Deep Learning and Operational Research to undergraduate, Master’s and Doctorate students. He has a Post-Doctorate degree in Data Analysis and Econometrics from Columbia University in New York. He is a tenured Professor by FEA/USP (with greater focus on Quantitative Modeling). He has a degree in Engineering from USP Polytechnic School, a post-graduate degree in Business Administration from Getúlio Vargas Foundation (FGV/SP), and he has received the titles of Master and PhD in Data Science and Quantitative Methods applied to Organizational Economics from FEA/USP. He is a Visiting Professor at the Federal University of Sao Paulo (UNIFESP), Dom Cabral Foundation, Getúlio Vargas Foundation, FIA, FIPE and MONTVERO. He has authored or co-authored 9 books and he is the founder and former editor-in-chief of the International Journal of Multivariate Data Analysis. He is member and founder of the Latin American Academy of Data Science. He is a consultant to companies operating in sectors such as retail, industry, mining, banks, insurance and healthcare, with the use of Data Analysis, Machine and Deep Learning, Big Data and AI platforms, such as R, Python, SAS, Stata and IBM SPSS.
Dr. Belfiore is Associate Professor at the Federal University of ABC (UFABC), where she teaches Data Science, Statistics, Operational Research, Production Planning and Control, and Programming and Algorithms Development to Engineering students. She has a master’s in electrical engineering and a PhD in production engineering from the Polytechnic School of the University of Sao Paulo (EPUSP). She has a post-doctorate degree in Operational Research and Computer Programming from Columbia University in New York. She takes part in several research and consultancy projects in the fields of modeling, optimization and programming. She has taught Operational Research, Multivariate Data Analysis and Operations Research and Logistics to undergraduate and master’s students at FEI University Center and at the Arts, Sciences and Humanities College of the University of Sao Paulo (EACH/USP). Her main research interests are in the fields of modeling, simulation, combinatorial optimization, heuristics and computer programming. She is the author/co-author of 9 books. She is a consultant to companies operating in sectors such as retail, industry, banks, insurance and healthcare, with the use of Process Simulation and Optimization, Data Analysis, and Machine and Deep Learning platforms, such as R, Python, Stata, IBM SPSS and ProModel.
Dr. Freitas Souza is Assistant Professor at the Economics, Business Administration and Accounting College of Ribeirao Preto of the University of São Paulo (FEARPUSP), where he teaches Programming Languages, Data Science and Analytics, Algorithm Design and Algorithm Development. He has a PhD in Business Management from the Economics, Business Administration and Accounting College of the University of São Paulo (FEAUSP). His main research interests are in the fields of Performance Management (Private and Public sectors) using Multivariate Modeling, Machine and Deep Learning techniques, including Spatial Analysis.
In this Book
-
Epigraph
-
Overview of Data Science, Analytics, and Machine Learning
-
Introduction to R-Based Language
-
Types of Variables, Measurement Scales, and Accuracy Scales
-
Univariate Descriptive Statistics
-
Bivariate Descriptive Statistics
-
Hypotheses Tests
-
Data Visualization and Multivariate Graphs
-
Webscraping and Handcrafted Robots
-
Using Application Programming Interfaces to Collect Data
-
Managing Data
-
Cluster Analysis
-
Principal Component Factor Analysis
-
Simple and Multiple Correspondence Analysis
-
Simple and Multiple Regression Models
-
Binary and Multinomial Logistic Regression Models
-
Count-Data and Zero-Inflated Regression Models
-
Generalized Linear Mixed Models
-
Support Vector Machines
-
Classification and Regression Trees
-
Boosting and Bagging
-
Random Forests
-
Artificial Neural Networks
-
Working on Shapefiles
-
Dealing with Simple Feature Objects
-
Raster Objects
-
Exploratory Spatial Analysis
-
Enhanced and Interactive Graphs
-
Dashboards With R
-
References