Download PNG
Download SVG
SLA driven follow-up tasks.
Why we got rid of our small-PR rule
Small PRs are great for humans, but not for AIs. Instead of focusing on diff size, we moved ot a risk-based model.
Quentin Rousseau
AI's failure mode has changed. Most incidents aren't code bugs: they're context bugs. And faster review can't catch them. A case for rollback.
When everyone can ship clean code with AI, the transcript of how they got there becomes a key signal.
TDX was buzzing with energy as Salesforce unveiled Agentforce 2.0, bringing agentic AI directly into Slack.
What SREs can learn from the CircleCI security incident of January 2023.
Best practices for “SRE pioneers” – meaning engineers who are the very first SREs hired at an organization.
A comparison of the two main SRE team models: Embedded SREs vs. standalone SRE teams.
A list of the top nine SRE skills, from incident management, to cloud computing, to networking and beyond.
An overview of major IT incidents and outages in 2021
An overview of how SREs can benefit from Infrastructure-as-Code.
Six tips on how Site Reliability Engineers (SREs) can prepare for the reliability challenges of Black Friday and Cyber Monday 2021
Follow these steps to write a great SRE job resume.