Implicit SLOs and their dangers

This is a topic of intermediate complexity in SLOs. If you are coming to this cold, we recommend you read a few other pieces about SLOs first, then this will make a fair bit more sense to you. SLOs, as you may know, have a dual nature: they have both…

Detecting Disturbance: incidents and Benford's Law

Recently we at Stanza have been exploring operational data, and it's been really exciting to bring techniques and ideas from other domains into our domain - production systems generally, traffic, alerting, cloud costs, etc. The thing we’ve been looking at most recently is a thing called Benford’…

The TwinSLO Proposal

Comments/Insights/Contributions from * Niall Murphy * Toby Burress * Štěpán Davidovič * Sal Furino (Note that when I say "we" below, I don't specifically intend to speak for these fine people, I'm just using the academic "we". -Niall) Introduction If you don’t already…

SRE in the Real World

(This is a repost of a document living here, but I am putting it here for backup's sake. Originally a joint effort with Murali Suriar, with input from Matt Brown, Liz Fong-Jones, and many others. The intended audience of this doc is the recently laid-off, or those who…

What SRE could be

Today, I believe we cannot successfully answer several key questions about SRE. Let's start with the most important one: how can we understand what reliability customers want and need?…