Learning Observability

pillars of observability Ronaldo Geldres

With observability becoming more understood and common, I felt it would be useful to compile a list of learning material that people new to this area might appreciate. The better we observe and consequently understand our systems, the better we get at predicting and reducing risks.

A huge thanks to Abby Bangser who has shared most of this material in the women in test #infra_and_devops channel.

If you have content on observability that is useful for people wanting to learn more, please add to comments below. 


The doyenne’s of observability. These folks are the ones that have tirelessly driven and promoted observability before it became a ‘thing’. There’s a mountain of great content on the honeycomb website. Start with https://www.honeycomb.io/resources/white-papers/ but also they have a podcast and videos.

Observability by Doing

if you are the type of person who learns by doing then this observability playground by Abby Bangser is perfect for you. A simple webapp but integrated with loads of monitoring and observability type tools to learn the basics. 

You can download here  

And if you want to use Azure instead of AWS go to Parveen Khans write up on pairing with Abby Bangser

I’m holding a Friday mentoring group for testers new to observability starting 5th June 2020. You can sign up here


First up, looking at the 3 pillars (see feature image) as the ‘fuel’ not the car by Ben Stigelman 

This is an excellent write up on Observability by Andy Dote 

And this video Unifying your Observability Pipeline by Aditya Mukerjee 

How to Build Observable Distributed Systems by Pierre Vincent 

the Venn diagram of observability by David Worth

The Missing O11y Primer by Daniel Dyla 

I like this analogy by Katy Farmer: What is Observability? 

Related Content 

Learn about Structured Logging by Aditya Praharaj 

Metrics (and their challenges)

Example of the challenges with metrics (aggregates)

SLO (Service Level Objectives)

Learn about SLO’s and SLI’s in this pdf created by Julie McCoy with Nicole Forsgren

SLO Adoption and Usage in SRE


Prometheus Explained  by S Santhosh Nagaraj


Links to further content on resilience by Lorin Hochstein

Learning Observability

Leave a Reply

Your email address will not be published. Required fields are marked *

Scroll to top