How to Improve Upon Google’s Four Golden Signals of Monitoring
The Four Golden Signals of monitoring and observability get a lot of things right. But they could be even better.
November 4, 2024
4 mins
KubeCon doesn’t have an SRE track, so we’ve gone through the 300+ talks so you don’t have to. We picked the ones that we find more inspiring for reliability folks.
KubeCon North America is just around the corner, featuring over three hundred talks across four days. Deciding which sessions to attend in Salt Lake City requires careful planning to make the most of your time. To help with that, we’ve curated a list of talks particularly relevant for SREs. From case studies of reliability at scale to the relationship between AI and SRE, we hope you find interesting talks to add to your KubeCon schedule.
Rootly will also have a big presence at KubeCon. With a talk during Platform Engineering Day, a booth in the Solutions Showroom, a co-hosted lunch with Spotify on Wednesday, and a happy hour on Thursday. You’ll find all the details at the end of this article.
Unfortunately, both case studies on scaling a company’s reliability are scheduled at the same time, so you’ll need to choose which scale is more interesting to you. Will you explore how a titan like Global Payments ensures system reliability, or learn how a rapidly scaling fintech like Cash App develops new strategies to prevent outages?
How do you ensure over 32 billion card transactions go through securely every time? Trey Caliva, Principal Cloud Architect at Global Payments, will introduce us to the architecture behind their multi-region setup on GCP with Kubernetes and CockroachDB.
When: Wednesday, November 13, 2024, 3:25 pm - 4:00 pm MST
Where: Salt Palace | Level 1 | 155 B
Add Global Payments' talk to your schedule
Rachel Sheikh, Software Engineer at Cash App, will showcase how the company scaled up its reliability strategy. The Cash App team introduced a new paradigm for their Kubernetes clusters that allows services to transition in and out while providing guardrails against common outages.
When: Wednesday, November 13, 2024, 3:25 pm - 4:00 pm MST
Where: Salt Palace | Level 1 | Grand Ballroom H
Add Cash App’s talk to your schedule
The evolution of AI is reshaping how SRE teams operate, potentially unlocking new possibilities in the tracing arena. However, AI is not a panacea but yet another system that SREs must manage.
Traditional tracing solutions often prioritize common traces, making rare traces invisible. Yet, these rare traces are precisely what can be crucial for diagnosing API failures. Mitul Tandon and Akash Gusain will discuss an AI-based tracing solution that treats all traces equally, leading to improved MTTR through more effective diagnoses.
When: Thursday, November 14, 2024, 11:55 am - 12:30 pm MST
Where: Salt Palace | Level 1 | Grand Ballroom B
Add this AI-based tracing talk to your schedule
LLM deployments are vast and complex, presenting new challenges for SREs. How do you identify which part of the system is draining resources or causing performance issues? Seema Saharan, SRE at Autodesk, and Aditya Soni, DevOps Engineer at Forrester, will dive into what it means to improve the efficiency of an LLM deployed with Kubernetes.
When: Tuesday, November 12, 2024, 12:55 pm - 1:20 pm MST
Where: Salt Palace | Level 2 | 255 B
Add this talk on making LLMs more reliable to your schedule
Many KubeCon talks inspire you to challenge assumptions and spark creative ideas. But once you’re back at work, it’s also valuable to have actionable knowledge you can apply immediately.
OpenTelemetry, also known as OTel, has become the standard observability framework in recent years. In this tutorial, you’ll learn how to instrument Python and Java applications with OpenTelemetry.
When: Friday, November 15, 2024, 11:00 am - 12:30 pm MST
Where: Salt Palace | Level 1 | Grand Ballroom G
Add OTel tutorial to your schedule
Upgrading Kubernetes is a recurring pain point for DevOps and SREs. In this talk, Jago Macleod, Engineering Director at Google, will discuss strategies to simplify the process and achieve more reliable rollouts.
When: Friday, November 15, 2024, 11:55 am - 12:30 pm MST
Where: Salt Palace | Level 1 | Grand Ballroom H
Add this talk on Kubernetes upgrades to your schedule
Meet with our reliability experts in the Expo Showroom. Our booth is located in the Startups Pavilion—look for Booth Q47 on the venue map.
Our very own Jorge Lainfiesta will be giving a talk with Abby Bangser, CNCF Platforms WG Chair, on how to make platforms and portals easier to maintain, scale, and use in the long run. Add the talk to your schedule.
We’re partnering with Spotify to enhance your lunch experience at KubeCon. Join us for an exclusive engineering leadership lunch on Wednesday, November 13. RSVP now—spots are limited.
Alongside Snowflake, Panther, and Infiscal, we’re hosting a KubeCon Engineering Leaders Happy Hour on Thursday, November 14. RSVP now—spots are limited.