SRE Team Management: Operational Overload

SRE    |    Intermediate
  • 14 videos | 55m 21s
  • Includes Assessment
  • Earns a Badge
Rating 4.6 of 21 users Rating 4.6 of 21 users (21)
Site reliability engineers (SREs) are responsible for many administrative tasks, often splitting their time between reactive ops work and special projects. To ensure teams do not become overloaded, SREs may be transferred to a team in order to prevent or help mitigate overload. In this course, you will learn how to deal with operational overload. You'll start by examining ops mode, which is an approach used to ensure services are properly maintained and optimized. You'll discover factors that contribute to team morale and stress. In addition, you will outline emergency planning strategies and best practices, as well as learn how to categorize emergencies and prepare detailed emergency plans. Next, you'll explore how knowledge sharing relates to emergency preparedness, the key to writing successful postmortems, the importance of service level objectives, and how an appropriate level of detail is required to properly explain your findings. Lastly, you'll discover the key factors and attributes of successful teams. You'll examine a team-first approach and differentiate between questioning techniques such as open/closed, funnel, probing, and leading.

WHAT YOU WILL LEARN

  • Discover the key concepts covered in this course
    Describe the term ops mode and differentiate between ops mode and nonlinear scaling
    Outline factors that contribute to team morale and stress such as financial and managerial impacts
    List the details to include in an it emergency plan
    Outline possible emergencies to plan for, such as undiagnosed alerts and knowledge gaps
    Describe how knowledge sharing can help teams plan for emergencies and recover from failures
    Recognize key factors of a high-quality postmortem
  • Classify team emergencies into either 'toil' or 'not toil' categories
    Recognize the importance of service level objectives (slos) as they relate to a long-term sre focus
    Describe steps to ensure a team-first approach to fixing overload issues
    Outline the importance of properly explaining findings and applying an appropriate level of detail for explanations
    List key attributes of successful teams including purpose, trust, and awareness
    Differentiate between questioning techniques such as open/closed, funnel, probing, and leading
    Summarize the key concepts covered in this course

IN THIS COURSE

  • 1m 43s
  • 3m 46s
    Upon completion of this video, you will be able to describe the term "ops mode" and differentiate between ops mode and nonlinear scaling. FREE ACCESS
  • Locked
    3.  Operational Stress
    2m 58s
    In this video, find out how to outline factors that contribute to team morale and stress, such as financial and managerial impacts. FREE ACCESS
  • Locked
    4.  IT Emergency Planning
    4m 59s
    Upon completion of this video, you will be able to list the details that should be included in an IT emergency plan. FREE ACCESS
  • Locked
    5.  Impending Emergencies
    5m 34s
    Learn how to outline possible emergencies to plan for, such as undiagnosed medical conditions and knowledge gaps. FREE ACCESS
  • Locked
    6.  Knowledge Sharing
    3m 34s
    After completing this video, you will be able to describe how knowledge sharing can help teams plan for emergencies and recover from failures. FREE ACCESS
  • Locked
    7.  Blameless Postmortems
    4m 46s
    Upon completion of this video, you will be able to recognize key factors of a high-quality postmortem. FREE ACCESS
  • Locked
    8.  Categorizing Emergencies
    4m 11s
    In this video, you will classify team emergencies into either 'toil' or 'not toil' categories. FREE ACCESS
  • Locked
    9.  Choosing an Appropriate SLO
    4m 5s
    Upon completion of this video, you will be able to recognize the importance of service level objectives (SLOs) as they relate to a long-term focus on SRE. FREE ACCESS
  • Locked
    10.  Effective Teams
    3m 37s
    Upon completion of this video, you will be able to describe steps to ensure a team-first approach to fixing overload issues. FREE ACCESS
  • Locked
    11.  Findings and Reasonings
    4m 29s
    Learn how to outline the importance of properly explaining findings and applying an appropriate level of detail for explanations. FREE ACCESS
  • Locked
    12.  Attributes of Successful Teams
    4m 46s
    Upon completion of this video, you will be able to list key attributes of successful teams, including purpose, trust, and awareness. FREE ACCESS
  • Locked
    13.  Questioning Techniques
    5m 49s
    In this video, you will learn how to differentiate between questioning techniques such as open-ended, funnel, probing, and leading. FREE ACCESS
  • Locked
    14.  Course Summary
    1m 7s

EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE

Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.

Digital badges are yours to keep, forever.

YOU MIGHT ALSO LIKE

Rating 4.6 of 94 users Rating 4.6 of 94 users (94)
Rating 4.7 of 100 users Rating 4.7 of 100 users (100)
Rating 5.0 of 1 users Rating 5.0 of 1 users (1)

PEOPLE WHO VIEWED THIS ALSO VIEWED THESE

Rating 4.3 of 328 users Rating 4.3 of 328 users (328)
Rating 4.5 of 2820 users Rating 4.5 of 2820 users (2820)
Rating 4.5 of 64 users Rating 4.5 of 64 users (64)