Skip links

Open Observability Day 2022


Confidence with Chaos for Your Kubernetes Observability

Michael Friedrich, GitLab


The Kubernetes observability stack is deployed with Prometheus, and dashboards provide many interesting insights. What’s next? Everything is overwhelming and your teams are drowning in alerts. Alerts and Service Level Objectives (SLO) require team discussions and adjustments, and the dashboards could benefit from more fine granular details. Documentation and action items for your SRE and DevOps teams are needed too. Simulate a production incident to see whether SLOs are met, or alerts are fired. Is there a way to observe the deployed application, and see if it breaks from chaos? Join this talk to dive into ops and dev stories with practical insights into Kubernetes metrics, Prometheus alerting, chaos engineering with Chaos Mesh, OpenTelemetry app instrumentation, and hear about production incidents with failed SLOs. Gain confidence with chaos as an SRE, and as a developer seeing the value in Observability. Welcome to day 2 DevOps.

Presentation Deck



Michael Friedrich
Senior Developer Evangelist, GitLab

Michael Friedrich is a Senior Developer Evangelist at GitLab focussing on Observability, SRE, and Ops. He loves to help educate everyone and regularly speaks at events and meetups. Michael co-founded the #EveryoneCanContribute cafe meetup group to learn cloud-native & DevOps. Michael is a Polynaut advisor at Polywork, created as a learning platform for Observability, and shares insights in the newsletter.