SRE Metric Management: Software Reliability Monitoring and Reporting
SRE
| Intermediate
- 17 videos | 1h 17m 47s
- Includes Assessment
- Earns a Badge
Once SRE metrics have been identified, site reliability engineers (SREs) must know how to perform fault analysis on a system, classify defects, and monitor and report data. In this course, you'll explore the tools and best practices for carrying out these procedures. You'll begin by identifying various fault analysis methods and tools. You'll then classify software defects and bugs with a focus on severity and priority. Next, you'll investigate strategies for monitoring APIs and explore some tools used for this task. You'll then examine in detail several tools for collecting, analyzing, and reporting metric data using a customizable dashboard, including those that comprise the ELK Stack - Elasticsearch, Logstash, and Kibana. Furthermore, you'll explore the data collection tool Beats and the beneficial use cases for Elasticsearch notifications.
WHAT YOU WILL LEARN
-
Discover the key concepts covered in this courseOutline various methods for analyzing the effects of faults in a systemOutline how to use fault tree analysis to determine the cause of faults in a systemName the tools that can be used to perform fault tree analysisOutline how to classify software defectsDescribe the various types of software bugs and recognize why they occurDifferentiate between the severity and priority of software bugsOutline best practice when defining api monitoring strategiesState the key characteristics of api monitoring strategies
-
List api monitoring tools and their strengths and weaknessesIdentify the components of the elk stack and how they work together for data reportingDescribe the features and benefits of elasticsearch for storing log dataDescribe the features and benefits of kibana for viewing dataDescribe the features and benefits of beats for data collectionDescribe the features and benefits of logstash for data processingOutline how to use elasticsearch notifications to notify staff when api services have issuesSummarize the key concepts covered in this course
IN THIS COURSE
-
1m 26s
-
5m 32s
-
6m 11s
-
2m 38s
-
4m 25s
-
4m 50s
-
4m 37s
-
6m 30s
-
5m 43s
-
4m 54s
-
4m 33s
-
5m 10s
-
3m 59s
-
5m 43s
-
5m 33s
-
5m 1s
-
1m 2s
EARN A DIGITAL BADGE WHEN YOU COMPLETE THIS COURSE
Skillsoft is providing you the opportunity to earn a digital badge upon successful completion on some of our courses, which can be shared on any social network or business platform.
Digital badges are yours to keep, forever.