About
The Chaos Engineering Toolkit empowers developers and SREs to proactively test system robustness by simulating real-world failures such as network latency, resource exhaustion, and service outages. By guiding users through experiment design, tool selection, and result analysis, the skill helps teams validate recovery strategies like circuit breakers and retry logic before production issues occur. It acts as a specialist companion for conducting GameDays and ensuring high availability across Kubernetes, AWS, and complex distributed architectures.