Alerting as Code: How Mistral AI Uses Terraform as the Source of Truth
A Terraform-first model for deterministic alerting in AI systems
.png)


The Suez Canal has been big news over the last couple of weeks. We wondered how a Site Reliability Engineer (SRE) might conduct a postmortem on what happened with the Ever Given, and what that might mean if a comparable incident occurred at a modern tech company.

JJ Tang

The Suez Canal has been big news over the last couple of weeks. We wondered how a Site Reliability Engineer (SRE) might conduct a postmortem on what happened with the Ever Given, and what that might mean if a comparable incident occurred at a modern tech company.

JJ Tang


By adding new complexity to reliability engineering and making physical war rooms a thing of the past, COVID-19 has imposed permanent changes on incident management. Here’s how SREs can respond.

Quentin Rousseau
By adding new complexity to reliability engineering and making physical war rooms a thing of the past, COVID-19 has imposed permanent changes on incident management. Here’s how SREs can respond.

Quentin Rousseau


Successful and blameless postmortems can turn incidents into a gift of learning and prevent repeat mistakes.

Camille Hodoul

Successful and blameless postmortems can turn incidents into a gift of learning and prevent repeat mistakes.

Camille Hodoul
