CRAN Recipes: DPLYR, Stringr, Lubridate, and RegEx in R
- 2h 56m
- William Yarberry
- Apress
- 2021
Want to use the power of R sooner rather than later? Don’t have time to plow through wordy texts and online manuals? Use this book for quick, simple code to get your projects up and running. It includes code and examples applicable to many disciplines. Written in everyday language with a minimum of complexity, each chapter provides the building blocks you need to fit R’s astounding capabilities to your analytics, reporting, and visualization needs.
CRAN Recipes recognizes how needless jargon and complexity get in your way. Busy professionals need simple examples and intuitive descriptions; side trips and meandering philosophical discussions are left for other books.
Here R scripts are condensed, to the extent possible, to copy-paste-run format. Chapters and examples are structured to purpose rather than particular functions (e.g., “dirty data cleanup” rather than the R package name “janitor”). Everyday language eliminates the need to know functions/packages in advance.
What You Will Learn
- Carry out input/output; visualizations; data munging; manipulations at the group level; and quick data exploration
- Handle forecasting (multivariate, time series, logistic regression, Facebook’s Prophet, and others)
- Use text analytics; sampling; financial analysis; and advanced pattern matching (regex)
- Manipulate data using DPLYR: filter, sort, summarize, add new fields to datasets, and apply powerful IF functions
- Create combinations or subsets of files using joins
- Write efficient code using pipes to eliminate intermediate steps (MAGRITTR)
- Work with string/character manipulation of all types (STRINGR)
- Discover counts, patterns, and how to locate whole words
- Do wild-card matching, extraction, and invert-match
- Work with dates using LUBRIDATE
- Fix dirty data; attractive formatting; bad habits to avoid
Who This Book Is For
Programmers/data scientists with at least some prior exposure to R.
About the Author
William A. Yarberry, Jr., CPA, CISA, is principal consultant, ICCM Consulting LLC, based in Houston, Texas. His practice is focused on IT governance, Sarbanes-Oxley compliance, security consulting, and business analytics for cost management. He was previously a senior manager with PricewaterhouseCoopers, responsible for telecom and network services in the Southwest region. Yarberry has more than 30 years’ experience in a variety of IT-related services, including application development, internal audit management, outsourcing administration, and Sarbanes-Oxley consulting.
His books include The Effective CIO (co-authored), Computer Telephony Integration, $250K Consulting, DPLYR, 50,000 Random Numbers, Telecommunications Cost Management, and GDPR: A Short Primer. In addition, he has written over 20 professional articles on topics ranging from wireless security to change management. One of his articles, "Audit Rights in an Outsource Environment," received the Institute of Internal Auditors Outstanding Contributor Award.
Prior to joining PricewaterhouseCoopers, Yarberry was director of telephony services for Enron Corporation. He was responsible for operations, planning, and architectural design for voice communications servers and related systems for more than 7,000 employees. Yarberry graduated Phi Beta Kappa in chemistry from the University of Tennessee and earned an MBA at the University of Memphis. He enjoys reading history, swimming, hiking, and spending time with family.
In this Book
-
Dplyr
-
Stringr
-
Lubridate: Date and Time Processing
-
Regular Expressions: Introduction
-
Typical Uses
-
Some Simple Patterns
-
Character Classes
-
Elements of Regular Expressions
-
The Magnificent Seven
-
Regular Expressions in Stringr
-
Unicode
-
Tools for Development and Resources
-
RegEx Summary
-
Recipes for Common R Tasks
-
Data Structures
-
Visualization
-
Simple Prediction Methods
-
Smorgasbord of Simple Statistical Tests
-
Validation of Data
-
Shortcuts and Miscellaneous
-
Conclusion