Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing

4h 30m
Tirthajyoti Sarkar
Apress
2022

This book focuses on the Python-based tools and techniques to help you become highly productive at all aspects of typical data science stacks such as statistical analysis, visualization, model selection, and feature engineering.

You’ll review the inefficiencies and bottlenecks lurking in the daily business process and solve them with practical solutions. Automation of repetitive data science tasks is a key mindset that is promoted throughout the book. You’ll learn how to extend the existing coding practice to handle larger datasets with high efficiency with the help of advanced libraries and packages that already exist in the Python ecosystem.

The book focuses on topics such as how to measure the memory footprint and execution speed of machine learning models, quality test a data science pipelines, and modularizing a data science pipeline for app development. You’ll review Python libraries which come in very handy for automating and speeding up the day-to-day tasks.

In the end, you’ll understand and perform data science and machine learning tasks beyond the traditional methods and utilize the full spectrum of the Python data science ecosystem to increase productivity.

You will:

Write fast and efficient code for data science and machine learning
Build robust and expressive data science pipelines
Measure memory and CPU profile for machine learning methods
Utilize the full potential of GPU for data science tasks
Handle large and complex data sets efficiently

About the Author

Dr. Tirthajyoti Sarkar lives in the San Francisco Bay area works as a Data Science and Solutions Engineering Manager at Adapdix Corp., where he architects Artificial intelligence and Machine learning solutions for edge-computing based systems powering the Industry 4.0 and Smart manufacturing revolution across a wide range of industries. Before that, he spent more than a decade developing best-in-class semiconductor technologies for power electronics.

He has published data science books, and regularly contributes highly cited AI/ML-related articles on top platforms such as KDNuggets and Towards Data Science. Tirthajyoti has developed multiple open-source software packages in the field of statistical modeling and data analytics. He has 5 US patents and more than thirty technical publications in international journals and conferences.

He conducts regular workshops and participates in expert panels on various AI/ML topics and contributes to the broader data science community in numerous ways. Tirthajyoti holds a Ph.D. from the University of Illinois and a B.Tech degree from the Indian Institute of Technology, Kharagpur.

In this Book

Introduction
What is Productive and Efficient Data Science?
Better Programming Principles for Efficient Data Science
How to Use Python Data Science Packages More Productively
Writing Machine Learning Code More Productively
Modular and Productive Deep Learning Code
Build Your Own ML Estimator/Package
Some Cool Utility Packages
Memory and Timing Profile
Scalable Data Science
Parallelized Data Science
GPU-Based Data Science for High Productivity
Other Useful Skills to Master
Wrapping it up

FREE ACCESS

Book Fast Python: High Performance Techniques For Large Datasets

Journey Prompt Engineering for Programmers to Learn Python

(1)

Book Mastering Search Algorithms with Python: A Practical Guide for Efficient Data Search

Get Started

Sharpen your skills. Upgrade your career. Find the right learning path for you, based on your role and skills. Take part in hands-on practice, study for a certification, and much more - all personalized for you.

*Not included: Compliance, Leadership Development Program content, and Engineering books

Your content + our content + our platform = a path to learning success

Using our learning experience platform, Percipio, your learners can engage in custom learning paths that can feature curated content from all sources.

Learn More

Aspire to something bigger

Aspire Journeys are guided learning paths that set you in motion for career success.

Browse Aspire Journeys

Explore a world of live learning with Global Knowledge

Choose from convenient delivery formats to get the training you and your team need - where, when and how you want it.

Browse Live Learning

IT Skills and Salary Report

ESG Impact Report

Productive and Efficient Data Science with Python: With Modularizing, Memory profiles, and Parallel/GPU Processing

In this Book

YOU MIGHT ALSO LIKE