With observability becoming more understood and common, I felt it would be useful to compile a list of learning material that people new to this area might appreciate. The better we observe and consequently understand our systems, the better we get at predicting and reducing risks.
A huge thanks to Abby Bangser who has shared most of this material in the women in test #infra_and_devops channel.
If you have content on observability that is useful for people wanting to learn more, please add to comments below.
The doyenne’s of observability. These folks are the ones that have tirelessly driven and promoted observability before it became a ‘thing’. There’s a mountain of great content on the honeycomb website. Start with https://www.honeycomb.io/resources/white-papers/ but also they have a podcast and videos.
Observability by Doing
if you are the type of person who learns by doing then this observability playground by Abby Bangser is perfect for you. A simple webapp but integrated with loads of monitoring and observability type tools to learn the basics.
You can download here
And if you want to use Azure instead of AWS go to Parveen Khans write up on pairing with Abby Bangser
I’m holding a Friday mentoring group for testers new to observability starting 5th June 2020. You can sign up here.
This is an excellent write up on Observability by Andy Dote
And this video Unifying your Observability Pipeline by Aditya Mukerjee
How to Build Observable Distributed Systems by Pierre Vincent
the Venn diagram of observability by David Worth
The Missing O11y Primer by Daniel Dyla
I like this analogy by Katy Farmer: What is Observability?
Learn about Structured Logging by Aditya Praharaj
Metrics (and their challenges)
Example of the challenges with metrics (aggregates)
SLO (Service Level Objectives)
Learn about SLO’s and SLI’s in this pdf created by Julie McCoy with Nicole Forsgren
Links to further content on resilience by Lorin Hochstein