SRE: Alerting on SLOs

Terminology Refresher

  • Availability
  • Latency
sum(rate(http_requests_total{status!~"5.."})) 
/
sum(rate(http_requests_total{}))
sum(rate(http_request_duration_seconds_bucket{le="0.1"}))
/
sum(rate(http_request_duration_seconds_count))

Client Experience

Objective Quantities

Call to Action

Generic Tooling

Conclusion

Resources:

--

--

--

https://stackoverflow.com/users/594589/dm03514

Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Android Design Patterns

YTNObserver #3 Yenten (YTN) Raspberry Pi home miner

Observability and Instrumentation

How I passed AWS Certified Solutions Architect Exam in One Month without any prior experience

HTTP Response in Golang

2.5D Platformer — #2 Horizontal Movement

How to Send a WhatsApp API using Java

WhatsApp API using Java

Processors: CPU, GPU, FPGA, Accelerator

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
dm03514

dm03514

https://stackoverflow.com/users/594589/dm03514

More from Medium

Runtime Control: Why I Joined Glasnostic

Headshot with computer code by @markusspiske

Bite-Sized AWS (Part One)

Thoughts about managing Kafka clusters: How to deal with disaster

Shifting from Traditional Infra to Cloud Native Role