Blog

Incident management insights, guides, and product updates from Rootly

Search...
No items found.
7 Essential Tools for SREs7 Essential Tools for SREs

7 Essential Tools for SREs

From chaos engineering to monitoring and beyond, SREs rely on several key types of tools to do their jobs.

Quentin Rousseau

Quentin Rousseau

June 25, 2021
5 min read
7 Essential Tools for SREs

7 Essential Tools for SREs

From chaos engineering to monitoring and beyond, SREs rely on several key types of tools to do their jobs.

Quentin Rousseau

Quentin Rousseau

June 25, 2021
5 min read
No items found.
Practical Guide to SRE: Incident Severity LevelsPractical Guide to SRE: Incident Severity Levels

Practical Guide to SRE: Incident Severity Levels

Incident severity levels are a measurement of the impact an incident has on the business. Classifying the severity of an issue is critical to decide how quickly and efficiently problems get resolved.

Nancy Chauhan

Nancy Chauhan

June 17, 2021
4 min read
Practical Guide to SRE: Incident Severity Levels

Practical Guide to SRE: Incident Severity Levels

Incident severity levels are a measurement of the impact an incident has on the business. Classifying the severity of an issue is critical to decide how quickly and efficiently problems get resolved.

Nancy Chauhan

Nancy Chauhan

June 17, 2021
4 min read
No items found.
The Incident Review: 4 Times When Typos Brought Down Critical SystemsThe Incident Review: 4 Times When Typos Brought Down Critical Systems

The Incident Review: 4 Times When Typos Brought Down Critical Systems

Sometimes, as these 4 incidents highlight, major failure results from a mere typo or configuration oversight.

JJ Tang

JJ Tang

June 4, 2021
5 min read
The Incident Review: 4 Times When Typos Brought Down Critical Systems

The Incident Review: 4 Times When Typos Brought Down Critical Systems

Sometimes, as these 4 incidents highlight, major failure results from a mere typo or configuration oversight.

JJ Tang

JJ Tang

June 4, 2021
5 min read
No items found.
Incident Management vs. Incident Response - What's the Difference?Incident Management vs. Incident Response - What's the Difference?

Incident Management vs. Incident Response - What's the Difference?

What are the differences between incident management and incident response? The answer varies widely depending on whom you ask.

Quentin Rousseau

Quentin Rousseau

May 28, 2021
4 min read
Incident Management vs. Incident Response - What's the Difference?

Incident Management vs. Incident Response - What's the Difference?

What are the differences between incident management and incident response? The answer varies widely depending on whom you ask.

Quentin Rousseau

Quentin Rousseau

May 28, 2021
4 min read
No items found.
Practical Guide to SRE: Using SLOs to Increase ReliabilityPractical Guide to SRE: Using SLOs to Increase Reliability

Practical Guide to SRE: Using SLOs to Increase Reliability

Service Level Objectives (SLOs) are a key component of any successful Site Reliability Engineering initiative. The question is, what are SLOs; and how do you determine what your SLOs should be? Once you've done that, how should you use them?

Quentin Rousseau

Quentin Rousseau

May 13, 2021
9 min read
Practical Guide to SRE: Using SLOs to Increase Reliability

Practical Guide to SRE: Using SLOs to Increase Reliability

Service Level Objectives (SLOs) are a key component of any successful Site Reliability Engineering initiative. The question is, what are SLOs; and how do you determine what your SLOs should be? Once you've done that, how should you use them?

Quentin Rousseau

Quentin Rousseau

May 13, 2021
9 min read
No items found.
Practical Guide to SRE: Automating On-CallPractical Guide to SRE: Automating On-Call

Practical Guide to SRE: Automating On-Call

Let's all face it, on call work isn't fun. But it can be better. Even if you have to work on call, it would be nice to have at least some of the work done for you, before you drag yourself out of bed at 3am to respond to an incident.

JJ Tang

JJ Tang

May 6, 2021
8 min read
Practical Guide to SRE: Automating On-Call

Practical Guide to SRE: Automating On-Call

Let's all face it, on call work isn't fun. But it can be better. Even if you have to work on call, it would be nice to have at least some of the work done for you, before you drag yourself out of bed at 3am to respond to an incident.

JJ Tang

JJ Tang

May 6, 2021
8 min read
No items found.
How Kubernetes Can Both Help and Hinder Incident Management TeamsHow Kubernetes Can Both Help and Hinder Incident Management Teams

How Kubernetes Can Both Help and Hinder Incident Management Teams

Kubernetes makes it easier in certain ways to manage reliability. But incident response teams and SREs must also be prepared to handle the unique reliability challenges that K8s creates.

Quentin Rousseau

Quentin Rousseau

April 29, 2021
5 min read
How Kubernetes Can Both Help and Hinder Incident Management Teams

How Kubernetes Can Both Help and Hinder Incident Management Teams

Kubernetes makes it easier in certain ways to manage reliability. But incident response teams and SREs must also be prepared to handle the unique reliability challenges that K8s creates.

Quentin Rousseau

Quentin Rousseau

April 29, 2021
5 min read
No items found.
Creating Chaos to Achieve ReliabilityCreating Chaos to Achieve Reliability

Creating Chaos to Achieve Reliability

How can creating chaos achieve better reliability? Chaos and reliability might seem mutually exclusive, but through the use of Chaos Engineering, SREs can bring about meaningful changes to system resiliency.

JJ Tang

JJ Tang

April 22, 2021
5 min read
Creating Chaos to Achieve Reliability

Creating Chaos to Achieve Reliability

How can creating chaos achieve better reliability? Chaos and reliability might seem mutually exclusive, but through the use of Chaos Engineering, SREs can bring about meaningful changes to system resiliency.

JJ Tang

JJ Tang

April 22, 2021
5 min read
No items found.
Should You Be an SRE or a DevOps Engineer?Should You Be an SRE or a DevOps Engineer?

Should You Be an SRE or a DevOps Engineer?

SREs may have better long-term job prospects, but DevOps might be an easier career to pursue.

Quentin Rousseau

Quentin Rousseau

April 15, 2021
5 min read
Should You Be an SRE or a DevOps Engineer?

Should You Be an SRE or a DevOps Engineer?

SREs may have better long-term job prospects, but DevOps might be an easier career to pursue.

Quentin Rousseau

Quentin Rousseau

April 15, 2021
5 min read