Data Engineering on Microsoft Azure: Data Partitioning
Azure
| Intermediate
- 11 videos | 1h 2m 57s
- Includes Assessment
- Earns a Badge
Partitioning data is key to ensuring efficient processing. In this course, you'll explore what data partitioning is and the strategies for implementation. You'll learn about transactional and analytical workloads and how to determine the best strategy for your files and table storage. Then, you'll examine design patterns for efficiency and performance. You'll learn about partitioning dedicated SQL pools in Azure Synapse Analytics and partitioning data lakes. Finally, you'll learn how data sharding across multiple data stores can be used for improving transaction performance. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseRecognize data partitioning conceptsDescribe data partitioning strategies for different servicesCompare transactional and analytical workloads to determine which data store and partitioning strategy to implementRecognize criteria for determining how to partition files for efficient distribution and queryingDescribe how to partition a table to ensure efficient scalability for analytical workloads
-
Describe how the index table and materialized view design patterns can increase efficiency and performance of queriesDescribe table partitions used by azure synapse analytics and how to size them, and recognize the differences from sql serverDescribe when to implement partitioning at the storage layerDescribe how data sharding distributes load over multiple datastoresSummarize the key concepts covered in this course
IN THIS COURSE
-
1m 33sExplore what data partitioning is. You’ll define strategies for implementation. You’ll also cover transactional and analytical workloads. You’ll determine the best strategy for your files and table storage. Then you'll explore design patterns for efficiency and performance. Next, you'll explore partitioning, dedicated SQL Pools in Azure Synapse analytics and partitioning data links. FREE ACCESS
-
6m 53sHere, you’ll learn more about partitioning data. You’ll learn all about the physical partitioning of data and not the virtual partitioning within the same data table. You’ll learn the purpose of partitioning data. You’ll also learn about functional partitioning and what it's used for. Partitioning improves availability and it can improve security by separating sensitive data from non-sensitive data. FREE ACCESS
-
6m 40sTake a look at the data partitioning strategies for several Azure products. Explore SQL database, Cosmos database, Table storage, Azure search, and Azure service bus. Learn how to implement an elastic pool to achieve high horizontal scalability. An elastic pool breaks data into shards that can be stored across multiple SQL database instances. FREE ACCESS
-
6m 35sHere, you’ll look at datastore and partitioning strategies. You’ll look at how different workloads are served by these different strategies and how data is stored. You’ll learn about transactional workloads. The majority of common application operations fall under this category. You’ll also learn more about analytical workloads for finding useful patterns in data. FREE ACCESS
-
6m 5sDiscover Apache Spark, popular for processing large datasets. An Apache Hive is an equally popular data warehouse. These two technologies are often combined or data processing may take place in Spark and the data is then written to Hive. Here, you’ll look at some file partition strategies for using a data pipeline with Apache Spark and Apache Hive. FREE ACCESS
-
7m 12sHere, you’ll look at partitioning strategies for Azure Table Storage. You’ll discuss table entities that are stored in a table. You’ll learn about table partitioning, partition sizing, and the pros and cons of specific partition sizing strategies. You’ll also learn to perform a table partition stress test. Table entities in Azure Table storage are analogous to table rows. FREE ACCESS
-
8m 12sHere, you’ll look at designing datastores for efficiency and performance. You’ll learn about the index table pattern, which can optimize query performance by making data easier to find. You’ll also look at the materialized view pattern which represents data in a way that's more consumable to a query than the way it's stored in the data schema. FREE ACCESS
-
4m 50sHere, you’ll look at partitioning Azure Synapse Analytics Dedicated Pools. You’ll learn the advantages such partitioning holds for queries. You’ll also look at the advantages of partitioning when loading data. You’ll cover considerations when sizing partitions. Finally, you'll look at the details of partition splitting and switching. FREE ACCESS
-
7mDiscover partitioning of storage in Data Lakes. Explore Azure Table Storage, which allows you to store entities and query them by a unique key. Learn about Azure Blob Storage, which allows you to store any kind of document as a blob, be it structured, semi-structured, or non-structured data. Explore Azure Queue Storage, which allows for message handling. FREE ACCESS
-
7m 1sHere, you’ll investigate data sharding for scaling. You’ll discuss the limitations of not sharding, which means storing all of your data on a single physical server. Next, you'll discuss sharding and how it can be beneficial. Then you'll discuss the characteristics of three common sharding strategies. Finally, you'll look at factors to consider when implementing an effective sharding strategy. FREE ACCESS
-
54sYou’ve examined data partitioning strategies and designing for performance. You explored data partitioning and the strategies employed. You looked at transactional and analytical workloads and file partition strategies. You discovered partitioning strategies for table storage and designing for performance efficiency. You explored partitioning in Azure Synapse Analytics. You learned about partitioning data lakes and data sharding for scaling. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.