Guerrilla Analytics: A Practical Approach to Working with Data

  • 4h 59m
  • Enda Ridge
  • Elsevier Science and Technology Books, Inc.
  • 2015

Doing data science is difficult. Projects are typically very dynamic with requirements that change as data understanding grows. The data itself arrives piecemeal, is added to, replaced, contains undiscovered flaws and comes from a variety of sources. Teams also have mixed skill sets and tooling is often limited. Despite these disruptions, a data science team must get off the ground fast and begin demonstrating value with traceable, tested work products. This is when you need Guerrilla Analytics.

In this book, you will learn about:

  • The Guerrilla Analytics Principles: simple rules of thumb for maintaining data provenance across the entire analytics life cycle from data extraction, through analysis to reporting.
  • Reproducible, traceable analytics: how to design and implement work products that are reproducible, testable and stand up to external scrutiny.
  • Practice tips and war stories: 90 practice tips and 16 war stories based on real-world project challenges encountered in consulting, pre-sales and research.
  • Preparing for battle: how to set up your team's analytics environment in terms of tooling, skill sets, workflows and conventions.
  • Data gymnastics: over a dozen analytics patterns that your team will encounter again and again in projects.

About the Author

Enda Ridge is an accomplished data scientist whose experience spans consulting, pre-sales of analytics software and research in academia.

He has consulted to clients in the public and private sectors including financial services, insurance, audit and IT security. Enda is an expert in agile analytics for real world projects where data and requirements change often, resources and tooling are sometimes very limited and results must be traceable and auditable for high profile stakeholders. His experience includes analytics to support the forensic investigation of a major US bankruptcy and the remediation a UK bank's mis-selling of financial products. He has also applied machine learning and NoSQL approaches to problems in document classification, surveillance and IT access controls. His PhD used Design of Experiments techniques to methodically evaluate algorithm performance.

Enda has authored or co-authored 12 academic research papers, is an invited contributor to edited books and has spoken at several analytics practitioner conferences.

Enda holds a Bachelor's degree in Mechanical Engineering and Master's in Applied Computing from the National University of Ireland at Galway and was awarded the National University of Ireland's Travelling Studentship in Engineering. His PhD was awarded by the University of York, UK.

In this Book

  • Introducing Guerrilla Analytics
  • Guerrilla Analytics—Challenges and Risks
  • Guerrilla Analytics Principles
  • Stage 1—Data Extraction
  • Stage 2—Data Receipt
  • Stage 3—Data Load
  • Stage 4—Analytics Coding for Ease of Review
  • Stage 4—Analytics Coding to Maintain Data Provenance
  • Stage 6—Creating Work Products
  • Stage 7—Reporting
  • Stage 5—Consolidating Knowledge in Builds
  • Introduction to Testing
  • Testing Data
  • Testing Builds
  • Testing Work Products
  • People
  • Process
  • Technology
  • Closing Remarks
  • References
SHOW MORE
FREE ACCESS

YOU MIGHT ALSO LIKE