High-Performance Big Data Computing
- 5h 10m
- Dhabaleswar K. Panda, Dipti Shankar, Xiaoyi Lu
- The MIT Press
- 2022
An in-depth overview of an emerging field that brings together high-performance computing, big data processing, and deep learning.
Over the last decade, the exponential explosion of data known as big data has changed the way we understand and harness the power of data. The emerging field of high-performance big data computing, which brings together high-performance computing (HPC), big data processing, and deep learning, aims to meet the challenges posed by large-scale data processing. This book offers an in-depth overview of high-performance big data computing and the associated technical issues, approaches, and solutions.
The book covers basic concepts and necessary background knowledge, including data processing frameworks, storage systems, and hardware capabilities; offers a detailed discussion of technical issues in accelerating big data computing in terms of computation, communication, memory and storage, codesign, workload characterization and benchmarking, and system deployment and management; and surveys benchmarks and workloads for evaluating big data middleware systems. It presents a detailed discussion of big data computing systems and applications with high-performance networking, computing, and storage technologies, including state-of-the-art designs for data processing and storage systems. Finally, the book considers some advanced research topics in high-performance big data computing, including designing high-performance deep learning over big data (DLoBD) stacks and HPC cloud technologies.
About the Author
Dhabaleswar K. Panda is Professor and University Distinguished Scholar of Computer Science and Engineering at the Ohio State University.
Xiaoyi Lu is an Assistant Professor in the Department of Computer Science and Engineering at the University of California, Merced.
Dipti Shankar is currently working at SAP in Germany.
In this Book
-
Introduction
-
Parallel Programming Models and Systems
-
Parallel and Distributed Storage Systems
-
HPC Architectures and Trends
-
Opportunities and Challenges in Accelerating Big Data Computing
-
Benchmarking Big Data Systems
-
Accelerations with RDMA
-
Accelerations with Multicore/Accelerator Technologies
-
Accelerations with High-Performance Storage Technologies
-
Deep Learning over Big Data
-
Designs with Cloud Technologies
-
Frontier Research on High-Performance Big Data Computing
-
References