Discover Agent Skills for analytics & monitoring. Browse 47skills for Claude, ChatGPT & Codex.
Analyzes and optimizes Claude Flow swarm performance by detecting bottlenecks and providing actionable AI-powered recommendations.
Configures Prometheus for comprehensive metric collection, alerting, and observability across infrastructure and applications.
Identifies and resolves performance bottlenecks in Python code using advanced profiling tools and high-efficiency implementation patterns.
Implements end-to-end request tracking across microservices using Jaeger and Tempo to identify performance bottlenecks and debug complex distributed systems.
Defines and implements measurable service level objectives and error budgets to optimize system reliability and engineering velocity.
Designs and manages production-ready Grafana dashboards for real-time observability and comprehensive system monitoring.
Optimizes Python application performance through comprehensive profiling, memory analysis, and implementation of high-efficiency code patterns.
Configures Prometheus for comprehensive metric collection, infrastructure monitoring, and automated alerting systems.
Creates and manages production-grade Grafana dashboards for real-time system observability and metric visualization.
Implements distributed tracing with Jaeger and Tempo to monitor microservice request flows and pinpoint performance bottlenecks.
Implements end-to-end request tracking across microservices using Jaeger and Tempo to identify performance bottlenecks and service failures.
Creates and manages production-ready Grafana dashboards for real-time visualization of system, infrastructure, and application metrics.
Profiles and optimizes Python code to eliminate execution bottlenecks and improve resource efficiency.
Implements distributed tracing with Jaeger and Tempo to monitor cross-service request flows and identify performance bottlenecks.
Defines and implements Service Level Indicators (SLIs) and Objectives (SLOs) to manage service reliability and error budgets effectively.
Configures comprehensive Prometheus monitoring for infrastructure and applications, including metric collection, alerting rules, and service discovery.
Analyzes git history to quantify unplanned work, identify interrupt hotspots, and measure the impact of technical debt on team velocity.
Analyzes Claude Code sessions and git history to generate actionable retrospective reports for continuous workflow improvement.
Creates and manages production-ready Grafana dashboards for real-time visualization of system, infrastructure, and application metrics.
Systematizes the debugging process using hypothesis-driven workflows and isolated instrumentation to resolve regressions, incidents, and flaky tests.
Manages and audits decision-making workflows within Claude Code using the Model Context Protocol.
Configures Prometheus for robust infrastructure and application metric collection, storage, and alerting.
Defines and implements Service Level Indicators (SLIs) and Service Level Objectives (SLOs) with error budgets and automated alerting.
Profiles and optimizes Python code to eliminate bottlenecks, reduce memory usage, and improve application latency.
Defines and implements service reliability targets using SLIs, SLOs, and error budgets to balance innovation with system stability.
Facilitates structured 5 Whys analysis to identify systemic root causes of software incidents through neutral guidance and real-time visualization.
Monitors skill performance and gathers anonymous user feedback through a privacy-first, opt-in telemetry framework.
Analyzes Claude Code conversation history to provide semantic search, pattern detection, and interactive activity dashboards.
Sets up comprehensive OpenTelemetry monitoring for tracking Claude Code usage, token costs, and developer productivity.
Retrieves comprehensive Amplitude user profiles and activity logs using Device or User IDs via the REST API.
Scroll for more results...