The Unofficial SRE Track for KubeCon EU '25The Unofficial SRE Track for KubeCon EU '25

The Unofficial SRE Track for KubeCon EU '25

KubeCon doesn’t have an SRE track but we’ve gone through the 300+ sessions that’ll take place in London so you don’t have to.

Jorge Lainfiesta

Jorge Lainfiesta

March 19, 2025
10 mins
The Ultimate DORA Compliance Checklist for 2025The Ultimate DORA Compliance Checklist for 2025

The Ultimate DORA Compliance Checklist for 2025

This guide covers everything you need to know about DORA compliance, including deadlines, penalties, and a step-by-step checklist to meet the new EU regulation.

Jorge Lainfiesta

Jorge Lainfiesta

February 27, 2025
11 mins
Google SREs are changing the game again: a breakdown of their new approachGoogle SREs are changing the game again: a breakdown of their new approach

Google SREs are changing the game again: a breakdown of their new approach

Google SREs are redefining reliability practices with STAMP, addressing the limitations of traditional models as systems scale. Their approach highlights the need for system-wide hazard analysis.

Jorge Lainfiesta

Jorge Lainfiesta

January 8, 2025
7 mins
SRE Tools That Actually Work: Cut MTTR by 70% or MoreSRE Tools That Actually Work: Cut MTTR by 70% or More

SRE Tools That Actually Work: Cut MTTR by 70% or More

The right SRE tools can improve user trust and free engineers to focus on building rather than firefighting.

Jorge Lainfiesta

Jorge Lainfiesta

January 5, 2025
5 mins
The Best PagerDuty AlternativesThe Best PagerDuty Alternatives

The Best PagerDuty Alternatives

PagerDuty has long been a dominant player in the incident management space. As organizations grow, their incident response needs become more complex. Many teams then seek solutions that fit their specific requirements better.

Jorge Lainfiesta

Jorge Lainfiesta

January 4, 2025
6 mins
The Rapid Recovery Blueprint: Optimize Incident Response NowThe Rapid Recovery Blueprint: Optimize Incident Response Now

The Rapid Recovery Blueprint: Optimize Incident Response Now

This blueprint provides a comprehensive framework for optimizing your incident response process, reducing MTTR, and building resilience into your systems.

Jorge Lainfiesta

Jorge Lainfiesta

January 4, 2025
7 mins
10 SRE Tools the Most Reliable Engineering Teams Actually Use10 SRE Tools the Most Reliable Engineering Teams Actually Use

10 SRE Tools the Most Reliable Engineering Teams Actually Use

This article breaks down the 10 SRE tools that high-performing teams rely on to detect, respond to, and resolve incidents quickly. Whether you’re building your SRE toolkit or looking to improve your incident management process, these tools form the backbone of modern reliability engineering.

Jorge Lainfiesta

Jorge Lainfiesta

January 3, 2025
8 mins
Faster Incident Resolution Playbook: From Alert to Fix in MinutesFaster Incident Resolution Playbook: From Alert to Fix in Minutes

Faster Incident Resolution Playbook: From Alert to Fix in Minutes

Incident management software is the backbone of any high-performing response process. The right platform centralizes alerts, automates workflows, and keeps everyone on the same page from the first signal to the final fix.

Jorge Lainfiesta

Jorge Lainfiesta

January 3, 2025
5 mins
The Essential SRE Tooling Guide for Modern Engineering TeamsThe Essential SRE Tooling Guide for Modern Engineering Teams

The Essential SRE Tooling Guide for Modern Engineering Teams

We explore the essential SRE tooling landscape and how platforms are transforming incident management for modern engineering teams.

Jorge Lainfiesta

Jorge Lainfiesta

January 2, 2025
7 mins
Beyond Faster Alerts: How Top Teams Actually Resolve IncidentsBeyond Faster Alerts: How Top Teams Actually Resolve Incidents

Beyond Faster Alerts: How Top Teams Actually Resolve Incidents

While most teams have invested in faster alerts, the real challenge is what happens next: how quickly and effectively teams coordinate, communicate, and resolve incidents.

Jorge Lainfiesta

Jorge Lainfiesta

January 2, 2025
6 mins
MTTR Mastery: Build an Incident Response System That Actually WorksMTTR Mastery: Build an Incident Response System That Actually Works

MTTR Mastery: Build an Incident Response System That Actually Works

Building an incident response system that actually works requires more than just faster alerts. It demands a holistic approach that combines automation, collaboration, and actionable post-incident insights.

Jorge Lainfiesta

Jorge Lainfiesta

January 2, 2025
6 mins
5 Proven Tactics to Slash Incident Response Time by 50%5 Proven Tactics to Slash Incident Response Time by 50%

5 Proven Tactics to Slash Incident Response Time by 50%

Reducing incident response time can significantly impact business continuity and customer satisfaction. Here are five proven tactics that leverage insights from industry leaders.

Jorge Lainfiesta

Jorge Lainfiesta

January 2, 2025
4 mins