Data Engineering on Microsoft Azure: Physical Data Storage Structures
Azure
| Intermediate
- 11 videos | 1h 4m 43s
- Includes Assessment
- Earns a Badge
An effective storage structure is critical to big data implementation success. In this course, you'll explore data compression in databases and file storage. Then, you'll discover how partitioning and sharding are implemented in the database. Next, you'll explore designing tables in an Azure Synapse Analytics dedicated SQL pool, and implement geo-replication for redundancy in both databases and Azure Blob storage. You'll also discover implementing distribution schemes in Azure Synapse Analytics. Finally, you'll discover data archiving and long-term retention policies for Azure Blob storage and Azure SQL Databases. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseDefine considerations for implementing data compression technologies at the database and file levelCreate a data partition in a sql databaseDescribe how a shard map manager is used by an application to connect to the required azure sql databasesDescribe the key concepts for designing tables in an azure synapse analytics dedicated sql poolDeploy azure sql database geo-replication
-
Describe the options for redundancy in azure blob storageDetermine the appropriate distribution scheme for an azure synapse analytics database and build it into the table on creationArchive data in azure storage and rehydrate it when necessaryConfigure a long-term retention policy for an azure sql databaseSummarize the key concepts covered in this course
IN THIS COURSE
-
1m 25sIn this video, you’ll learn more about your instructor and this course. In this course, you’ll learn about data compression in databases and file storage. You’ll see how partitioning and sharding are implemented in the database. Next, you'll explore designing Tables in an Azure Synapse Analytics dedicated SQL Pool and implement geo-replication for redundancy in both databases and Azure blob storage. You’ll also learn about implementing distribution schemes in Azure Synapse Analytics. FREE ACCESS
-
4m 46sIn this video, you’ll learn more about how Data Compression can be enabled for SQL Server, Azure SQL database and Azure SQL managed instance. You’ll see database compression improves the performance of input-output workloads and data is stored in less pages, which increases the speed of queries. Row and page compression can be used for row tables and indexes to reduce the size of the database. You’ll also learn about Columnstore compression. FREE ACCESS
-
13m 18sIn this video, you’ll watch a demo. You’ll learn how to create a table partition in a Microsoft Azure SQL Database. You’ll learn partitioning and sharding serve the same goal and can scale similarly, but sharding gives you the additional option of geo-locating shards. You can create up to 15,000 partitions so table partitioning can handle a high degree of scale. In this demo, you’ll create two partitions, operational and historical. FREE ACCESS
-
3m 48sIn this video, you’ll learn about sharding data. Sharding is a method for distributing large amounts of identically structured data across multiple independent databases. There are many reasons to shard data. The amount of data may be too large to fit within a single database due to the database limitations. Or, the number of transactions that will be sent to the database may exceed the number of transactions that can be handled by the database. FREE ACCESS
-
7m 38sIn this video, you’ll learn about designing tables. You’ll learn designing a database structure is important for storing the correct data needed for future data analysis. It also allows the data to be queried effectively and efficiently. Getting the proper database structure right in the beginning will save you a lot of time, effort, and money in the long run. FREE ACCESS
-
8m 40sIn this video, you’ll watch a demo. You’ll learn how to implement Geo-redundancy with your Azure SQL Database. To start, you’ll go to the Azure console and choose SQL databases. You should be able to find your previously created SQL database. Right now, it's just a basic database. There’s no Replica type. You’ll start the Geo-Replication process now. FREE ACCESS
-
6m 52sIn this video, you’ll learn about redundancy. In particular, you’ll learn about file and blob data redundancy. File and blob data redundancy ensures your storage account achieves its availability and durability targets for both planned and unplanned events. This could include hardware failures, network outages, power outages, and massive natural disasters. An Azure storage account replicates data three times in the primary region. FREE ACCESS
-
6m 56sIn this video, you’ll watch a demo. In this demo, you’ll determine the appropriate Distribution Scheme for an Azure Synapse Analytics database and build it into the table upon creation. You’ll first go into your Azure console. Then, you’ll head to Azure Synapse Analytics. Click Azure Synapse Analytics. You'll see your workspace is already created. FREE ACCESS
-
5m 43sIn this video, you’ll watch a demo. In this demo, you’ll learn how to archive a blob storage and then rehydrate that storage as needed. The first thing you’ll do is head to your Storage accounts and click on Storage accounts. You’ll find your specific Storage account. It's a StorageV2. Then, you’ll head to Containers. Containers are where you’ll upload your file, archive it, and then rehydrate it. FREE ACCESS
-
4m 38sIn this video, you’ll watch a demo. In this demo, you’ll learn how to set up Data Retention policies within your Azure SQL Server database. First, you’ll go to the Azure console. Then, you’ll find the SQL Server you want to create your Data Retention policy for. You’ll configure data retention on this replicated database. You’ll need to click the database you want to configure. Then, you'll see Backups under Settings, and click Backups. FREE ACCESS
-
59sIn this video, you’ll summarize what you’ve learned in this course. You’ve learned how to plan and implement an efficient Physical Data Storage Structure. You explored considerations for implementing data compression technologies, partitioning and sharing data, and designing tables in an Azure Synapse Analytics. You also looked at database and file/blob data redundancy, implementing distribution schemes and data archiving, and configuring long-term retention policies for an Azure SQL database. FREE ACCESS
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.