Join us

heart Posts from the community tagged with slo...
Sponsored Link FAUN Team
@faun shared a link, 1 year ago

Read CloudNative Weekly Newsletter

CloudNative Weekly Newsletter, The Chief I/O. Curated CloudNative news, tutorials, tools and more!

Join thousands of other readers, 100% free, unsubscribe anytime.

Story
@yair_stark shared a post, 2 years, 1 month ago

Error Budget Is All You Need - Part 2

In part 1 I proposed a simple modification to Google’s Multi-Window Multi-Burn Rate alerting setup and I showed how this modification addresses the cases of varying-traffic services and typical latency SLOs.

1_gm3BXHRG_TVt9Hc5cQbOJA (1).png
Story
@yair_stark shared a post, 2 years, 1 month ago

Error Budget Is All You Need - Part 1

One of the great chapters of Google’s Site Reliability Engineering (SRE) second book is chapter 5 — Alerting on SLOs (Service Level Objectives). This chapter takes you on a comprehensive journey through several setups of alerts on SLOs, starting with the simplest non-optimized one and by iterating through several setups reach the ultimate one, which is optimized w.r.t to the main four alerting attributes: recall, precision, detection time and reset time.

1_gm3BXHRG_TVt9Hc5cQbOJA.png
Story
@squadcast shared a post, 2 years, 3 months ago

What can SREs do to make holiday season’s peak traffic less chaotic?

Holiday season's peak traffic is the most challenging period for SREs and on-call engineers. In this blog, we have highlighted the things that SREs can do to make the holiday season less chaotic.

HowCanSRE_BlackFriday-570x330.png