Data Engineering on Microsoft Azure: Physical Data Storage Structures

Azure    |    Intermediate
  • 11 videos | 1h 4m 43s
  • Includes Assessment
  • Earns a Badge
Rating 4.7 of 43 users Rating 4.7 of 43 users (43)
An effective storage structure is critical to big data implementation success. In this course, you'll explore data compression in databases and file storage. Then, you'll discover how partitioning and sharding are implemented in the database. Next, you'll explore designing tables in an Azure Synapse Analytics dedicated SQL pool, and implement geo-replication for redundancy in both databases and Azure Blob storage. You'll also discover implementing distribution schemes in Azure Synapse Analytics. Finally, you'll discover data archiving and long-term retention policies for Azure Blob storage and Azure SQL Databases. This course is one in a collection that prepares learners for the Microsoft Data Engineering on Microsoft Azure (DP-203) exam.

WHAT YOU WILL LEARN

  • Discover the key concepts covered in this course
    Define considerations for implementing data compression technologies at the database and file level
    Create a data partition in a sql database
    Describe how a shard map manager is used by an application to connect to the required azure sql databases
    Describe the key concepts for designing tables in an azure synapse analytics dedicated sql pool
    Deploy azure sql database geo-replication
  • Describe the options for redundancy in azure blob storage
    Determine the appropriate distribution scheme for an azure synapse analytics database and build it into the table on creation
    Archive data in azure storage and rehydrate it when necessary
    Configure a long-term retention policy for an azure sql database
    Summarize the key concepts covered in this course

IN THIS COURSE

  • 1m 25s
    In this video, you’ll learn more about your instructor and this course. In this course, you’ll learn about data compression in databases and file storage. You’ll see how partitioning and sharding are implemented in the database. Next, you'll explore designing Tables in an Azure Synapse Analytics dedicated SQL Pool and implement geo-replication for redundancy in both databases and Azure blob storage. You’ll also learn about implementing distribution schemes in Azure Synapse Analytics. FREE ACCESS
  • 4m 46s
    In this video, you’ll learn more about how Data Compression can be enabled for SQL Server, Azure SQL database and Azure SQL managed instance. You’ll see database compression improves the performance of input-output workloads and data is stored in less pages, which increases the speed of queries. Row and page compression can be used for row tables and indexes to reduce the size of the database. You’ll also learn about Columnstore compression. FREE ACCESS
  • Locked
    3.  Partitioning Data
    13m 18s
    In this video, you’ll watch a demo. You’ll learn how to create a table partition in a Microsoft Azure SQL Database. You’ll learn partitioning and sharding serve the same goal and can scale similarly, but sharding gives you the additional option of geo-locating shards. You can create up to 15,000 partitions so table partitioning can handle a high degree of scale. In this demo, you’ll create two partitions, operational and historical. FREE ACCESS
  • Locked
    4.  Sharding Data
    3m 48s
    In this video, you’ll learn about sharding data. Sharding is a method for distributing large amounts of identically structured data across multiple independent databases. There are many reasons to shard data. The amount of data may be too large to fit within a single database due to the database limitations. Or, the number of transactions that will be sent to the database may exceed the number of transactions that can be handled by the database. FREE ACCESS
  • Locked
    5.  Designing Tables
    7m 38s
    In this video, you’ll learn about designing tables. You’ll learn designing a database structure is important for storing the correct data needed for future data analysis. It also allows the data to be queried effectively and efficiently. Getting the proper database structure right in the beginning will save you a lot of time, effort, and money in the long run. FREE ACCESS
  • Locked
    6.  Ensuring Database Data Redundancy
    8m 40s
    In this video, you’ll watch a demo. You’ll learn how to implement Geo-redundancy with your Azure SQL Database. To start, you’ll go to the Azure console and choose SQL databases. You should be able to find your previously created SQL database. Right now, it's just a basic database. There’s no Replica type. You’ll start the Geo-Replication process now. FREE ACCESS
  • Locked
    7.  File and Blob Data Redundancy
    6m 52s
    In this video, you’ll learn about redundancy. In particular, you’ll learn about file and blob data redundancy. File and blob data redundancy ensures your storage account achieves its availability and durability targets for both planned and unplanned events. This could include hardware failures, network outages, power outages, and massive natural disasters. An Azure storage account replicates data three times in the primary region. FREE ACCESS
  • Locked
    8.  Implementing Distribution Schemes
    6m 56s
    In this video, you’ll watch a demo. In this demo, you’ll determine the appropriate Distribution Scheme for an Azure Synapse Analytics database and build it into the table upon creation. You’ll first go into your Azure console. Then, you’ll head to Azure Synapse Analytics. Click Azure Synapse Analytics. You'll see your workspace is already created. FREE ACCESS
  • Locked
    9.  Implementing Data Archiving
    5m 43s
    In this video, you’ll watch a demo. In this demo, you’ll learn how to archive a blob storage and then rehydrate that storage as needed. The first thing you’ll do is head to your Storage accounts and click on Storage accounts. You’ll find your specific Storage account. It's a StorageV2. Then, you’ll head to Containers. Containers are where you’ll upload your file, archive it, and then rehydrate it. FREE ACCESS
  • Locked
    10.  Configuring Database Long-term Retention
    4m 38s
    In this video, you’ll watch a demo. In this demo, you’ll learn how to set up Data Retention policies within your Azure SQL Server database. First, you’ll go to the Azure console. Then, you’ll find the SQL Server you want to create your Data Retention policy for. You’ll configure data retention on this replicated database. You’ll need to click the database you want to configure. Then, you'll see Backups under Settings, and click Backups. FREE ACCESS
  • Locked
    11.  Course Summary
    59s
    In this video, you’ll summarize what you’ve learned in this course. You’ve learned how to plan and implement an efficient Physical Data Storage Structure. You explored considerations for implementing data compression technologies, partitioning and sharing data, and designing tables in an Azure Synapse Analytics. You also looked at database and file/blob data redundancy, implementing distribution schemes and data archiving, and configuring long-term retention policies for an Azure SQL database. FREE ACCESS

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Rating 4.6 of 34 users Rating 4.6 of 34 users (34)
Rating 4.6 of 5 users Rating 4.6 of 5 users (5)
Rating 4.6 of 161 users Rating 4.6 of 161 users (161)

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 4.3 of 63 users Rating 4.3 of 63 users (63)
Rating 4.6 of 28 users Rating 4.6 of 28 users (28)
Rating 4.6 of 189 users Rating 4.6 of 189 users (189)