This skill provides a complete framework for instrumenting service meshes with deep visibility into microservice interactions. It offers standardized templates for deploying observability stacks including Prometheus, Grafana, Jaeger, and Kiali, while focusing on the 'Golden Signals' of latency, traffic, errors, and saturation. Whether you are debugging complex network bottlenecks, defining Service Level Objectives (SLOs), or visualizing service dependencies, this skill streamlines the configuration of telemetry and alerting within Kubernetes environments to ensure robust production reliability.
Key Features
01PromQL snippets for calculating P99 latency and error rates
02Automated Kiali and Grafana dashboard configurations
03Distributed tracing setup using Jaeger and OpenTelemetry
0423,194 GitHub stars
05Pre-configured Istio and Linkerd monitoring templates
06Standardized alerting rules for mesh health and certificate expiry