Christian Battaglia
December 20, 2019
2 min read
A sample:
IncidentBotAPP 1:32 PM
Issue: Science assessments Audio option is not appearing
Latest Update: Error rates have declined after removing possibly faulty server instances. We do not know the root cause, but users should now be able to access audio in assessments.
Status: Monitoring
Severity: SEV-3 (what's this?)
Affected products: Amplify Science
Response channel: #serious-badger-incident
Fact sheet: :clipboard: Fact Sheet (previous updates, etc.)
Definition:
Incidents are any interruption in normal service to our customers or employees.
Our Incident Management process is how we organize our response and communications when an incident occurs.
We use severity levels to indicate the scope of an incident, and to help guide the immediacy and scope of our response.
Please note that severity levels are not a popularity contest – we don't need to call something a SEV-1 just because we care about a particular customer or product a lot. They're just a guideline to clarify what is happening and how we are responding.
The first goal of the incident management process is to restore a normal service operation as quickly as possible and to minimize the impact on business operations, thus ensuring that the best possible levels of service quality and availability are maintained.
In addition to the primary goal, there are a series of lesser goals that would ideally be achieved: