Data Lake Architectures & Data Management Principles
Big Data
| Intermediate
- 10 videos | 34m 8s
- Includes Assessment
- Earns a Badge
A key component to wrangling data is the data lake framework. In this 9-video Skillsoft Aspire course, learners discover how to implement data lakes for real-time management. Explore data ingestion, data processing, and data lifecycle management with Amazon Web Services (AWS) and other open-source ecosystem products. Begin by examining real-time big data architectures, and how to implement Lambda and Kappa architectures to manage real-time big data. View benefits of adopting Zaloni data lake reference architecture. Examine the essential approach of data ingestion and comparative benefits provided by file formats Avro and Parquet. Explore data ingestion with Sqoop, and various data processing strategies provided by MapReduce V2, Hive, Pig, and Yam for processing data with data lakes. Learn how to derive value from data lakes and describe benefits of critical roles. Learners will explore steps involved in the data lifecycle and the significance of archival policies. Finally, learn how to implement an archival policy to transition between S3 and Glacier, depending on adopted policies. Close the course with an exercise on ingesting data and archival policy.
WHAT YOU WILL LEARN
-
Implement lambda and kappa architectures to manage real-time big dataIdentify the benefits of adopting zaloni data lake reference architectureDescribe data ingestion approaches and compare avro and parquet file format benefitsDemonstrate how to ingest data using sqoopDescribe the data processing strategies provided by mapreduce v2, hive, pig, and yam for processing data with data lakes
-
Recognize how to derive value from data lakes and describe the benefits of critical rolesDescribe the steps involved in the data life cycle and the significance of archival policiesImplement an archival policy to transition between s3 and glacier, depending on adopted policiesIngest data using sqoop and implement an archival policy to transition from s3 to adopted policies
IN THIS COURSE
-
2m 9s
-
4m 5sFind out how to implement Lambda and Kappa architectures to manage real-time big data. FREE ACCESS
-
2m 11sIn this video, find out how to identify the benefits of adopting Zaloni's data lake reference architecture. FREE ACCESS
-
4m 44sAfter completing this video, you will be able to describe data ingestion approaches and compare the benefits of Avro and Parquet file formats. FREE ACCESS
-
5m 55sIn this video, you will learn how to ingest data using Sqoop. FREE ACCESS
-
3m 42sUpon completion of this video, you will be able to describe the data processing strategies provided by MapReduce V2, Hive, Pig, and Yarn for processing data with data lakes. FREE ACCESS
-
2m 32sUpon completion of this video, you will be able to recognize how to derive value from data lakes and describe the benefits of critical roles. FREE ACCESS
-
2m 28sUpon completion of this video, you will be able to describe the steps involved in the data life cycle and the significance of archival policies. FREE ACCESS
-
4m 4sIn this video, learn how to implement an archival policy to transition between S3 and Glacier, depending on the policies you have adopted. FREE ACCESS
-
2m 19sIn this video, find out how to ingest data using Sqoop and implement an archival policy to transition from S3 to Glacier. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.