In this workshop, you will learn the basics of Monitoring and Observability with microservices and container clusters. The handson session includes Prometheus and Grafana, kubeprometheus, alerts/incidents, and customizations/scaling. We will discuss best practices with logs, traces, and how innovation with OpenTelementry and eBPF can help with advanced Observability workflows.
Speaker Michael Friedrich (@dnsmichi)
►►► Prerequisites ►►►
1. Installed & configured a minikube cluster (https://minikube.sigs.k8s.io/docs/start/)
2. Make sure to go through the previous workshops:
Linux & Docker Fundamentals: • Linux & Docker Fundamentals complet...
Kubernetes 101: • Kubernetes 101 workshop complete ha...
►►►Resources ►►►
► Workshop Slide Deck https://docs.google.com/presentation/...
Chapters
00:00 Intro music
01:38 Introduction with Saiyam and Michael
08:10 Workshop Start
08:58 What to expect
10:05 Workshop requirements
12:22 Tips
15:06 Monitoring, quo vadis in a nutshell, blackbox, metrics/trends, microservices, whitebox
23:00 Kubernetes learn what to monitor
29:45 Metrics with Prometheus Architecture, PromQL, UI,
39:15 Prometheus Operator install Prometheus in Kubernetes
52:22 Kubernetes Metrics
54:45 Prometheus Metrics in Grafana Dashboards inspection, deployed by Prometheus Operator
01:02:56 Workshop dashboards in Grafana first panel and Kubernetes dashboard, container metrics, kubestatemetrics,
01:21:34 ServiceMonitor CRD for /metrics endpoints auto discovery with Prometheus
01:37:56 Monitoring 2.0 Prometheus client libraries & instrumentation,
01:45:43 Alerts and SLOs Prometheus Operator CRDs, Alert Manager, podtatohead
01:55:53 Trigger alerts for podtatohead deployment blackbox probe, dashboards, alert rules,
02:14:45 Service Level Objectives & Ops Confidence Golden Signals, SLOs as code,
02:19:15 Customize kubeprometheus custom dashboards, reduce visible data, see what is important
02:22:50 Beyond Metrics Logs, Tracing, OpenTelemetry
02:31:28 OpenTelemetry demo deployment shop, website, Jaeger tracing,
02:39:05 Discussion: Why traces? What's next
02:45:35 Performance and Scaling data retention, longterm storage, distributed scaling, GitLab.com SaaS production insights,
02:53:50 Observability and Chaos Engineering
02:56:06 Chaos Mesh
02:58:10 DNS Chaos demo exercise from KubeCon EU, leak app memory, verify alerts and SLOs
03:12:45 Take action from chaos experiments
03:15:00 Future Observability eBPF, autoinstrumentation with Cilium Tetragon, etc.
03:17:30 SLOs and quality gates with Keptn
03:17:48 Security Policies with Kyverno, Hardening (book recommendation: Hacking Kubernetes)
03:18:54 Your adventure
03:19:20 Q&A https://o11y.love and outro with Saiyam and Michael
►►►Connect with Kubesimplify ►►►
► Twitter / kubesimplify
► Discord https://kubesimplify.com/discord'>https://kubesimplify.com/discord
► Website https://kubesimplify.com/
► GitHub https://github.com/kubesimplify
#kubesimplify #kubernetes #observability