Join us

heart Posts from the community tagged with slo...
Sponsored Link FAUN Team
@faun shared a link, 1 year, 1 month ago

Read AI/M Weekly

AI Weekly Newsletter, Kala. Curated AI news, tutorials, tools and more - Join thousands of other readers, 100% free, unsubscribe anytime.

Story
@yair_stark shared a post, 2 years, 2 months ago

Error Budget Is All You Need - Part 2

In part 1 I proposed a simple modification to Google’s Multi-Window Multi-Burn Rate alerting setup and I showed how this modification addresses the cases of varying-traffic services and typical latency SLOs.

1_gm3BXHRG_TVt9Hc5cQbOJA (1).png
Story
@yair_stark shared a post, 2 years, 2 months ago

Error Budget Is All You Need - Part 1

One of the great chapters of Google’s Site Reliability Engineering (SRE) second book is chapter 5 — Alerting on SLOs (Service Level Objectives). This chapter takes you on a comprehensive journey through several setups of alerts on SLOs, starting with the simplest non-optimized one and by iterating through several setups reach the ultimate one, which is optimized w.r.t to the main four alerting attributes: recall, precision, detection time and reset time.

1_gm3BXHRG_TVt9Hc5cQbOJA.png
Story
@squadcast shared a post, 2 years, 4 months ago

What can SREs do to make holiday season’s peak traffic less chaotic?

Holiday season's peak traffic is the most challenging period for SREs and on-call engineers. In this blog, we have highlighted the things that SREs can do to make the holiday season less chaotic.

HowCanSRE_BlackFriday-570x330.png