Improving Continuously
Lecture
Shane Davis
Beginner
1 h 1 m
2020-08-10
Lecture Overview
This course is designed for IT leaders, developers, and operations folks who have fully embraced building a mature DevOps organization. We start off by digging into what the phrase “failing fast” means along with some common misconceptions.Then, we will look at how to prepare for incidents and service outages. We’ll walk through how to ensure that your processes reduce the possibility that humans will be the cause of an incident, how to better prepare your on-call team for incident response, and what to do when an incident strikes.This course will wrap up by looking at the contributing factors of failure and going beyond root cause analysis, the phases of an incident or outage, and the way to run a post-incident review.

Related Learning Path(s):
DevOps for the Enterprise
Objectives
  • This course outlines the steps to increase the maturity of your DevOps organization.
  • You’ll learn how to embrace a growth mindset along with an approach to reviewing incidents as a team.
  • You’ll also learn how to reduce the possibility that humans will be cause of an incident, how to better prepare your on-call team for incident for incident response, and what to do when an incident strikes so you can better prepare yourself and your team for when an incident occurs.
  • This course will go beyond root cause analysis and cover the phases of an incident or outage, and provide guidance on the way to run a post-incident review.
Pre-Requisites
  • Basic knowledge of the Agile methodology
  • Basic knowledge of the software development life cycle
  • Foundational understanding of DevOps principles
Lecture Modules
Module 1 digs into the meaning of the phrase “failing fast” and some common misconceptions around the phrase. We’ll also look at how to embrace a growth mindset along with an approach to reviewing incidents as a team.
Module 2 looks at how to prepare for incidents and service outages. We’ll also look at how to reduce the possibility that humans will be cause of an incident, how to better prepare your on-call team for incident for incident response, and what to do when an incident strikes so you can better prepare yourself and your team for when an incident occurs.
Module 3 is about going beyond root cause analysis, the phases of an incident or outage, and the way to run a post-incident review.
Try Risk Free

Start a free trial

Skill Me Up subscriptions include unlimited access to on-demand courses with live lab lab environments with our Real Time Labs feature for hands-on lab access.

Subscription Benefits
  • Access to Real Time Lab environments and lab guides
  • Course Completion Certificates when you pass assessments
  • MUCH MORE!