Open Observability Day 2022
Confidence with Chaos for Your Kubernetes Observability
Michael Friedrich, GitLab
The Kubernetes observability stack is deployed with Prometheus, and dashboards provide many interesting insights. What’s next? Everything is overwhelming and your teams are drowning in alerts. Alerts and Service Level Objectives (SLO) require team discussions and adjustments, and the dashboards could benefit from more fine granular details. Documentation and action items for your SRE and DevOps teams are needed too. Simulate a production incident to see whether SLOs are met, or alerts are fired. Is there a way to observe the deployed application, and see if it breaks from chaos? Join this talk to dive into ops and dev stories with practical insights into Kubernetes metrics, Prometheus alerting, chaos engineering with Chaos Mesh, OpenTelemetry app instrumentation, and hear about production incidents with failed SLOs. Gain confidence with chaos as an SRE, and as a developer seeing the value in Observability. Welcome to day 2 DevOps.
Senior Developer Evangelist, GitLab
Michael Friedrich is a Senior Developer Evangelist at GitLab focussing on Observability, SRE, and Ops. He loves to help educate everyone and regularly speaks at events and meetups. Michael co-founded the #EveryoneCanContribute cafe meetup group to learn cloud-native & DevOps. Michael is a Polynaut advisor at Polywork, created o11y.love as a learning platform for Observability, and shares insights in the opsindev.news newsletter.