Discover Agent Skills for analytics & monitoring. Browse 47 skills for Claude, ChatGPT & Codex.
Generates a visual 2025 year-in-review report for any repository by analyzing commits, pull requests, and issue activity.
Performs comprehensive audits and automated repairs for Claude Code skills, agents, hooks, and memory systems.
Measures software quality through actionable DORA metrics and automated quality gates to drive continuous improvement.
Implements industry-standard logging, metrics, and distributed tracing patterns to ensure system reliability and monitoring.
Conducts structured root cause analysis using an evidence-based methodology to isolate true system failures from contributing factors.
Enforces statistical rigor and mandatory validation gates for designing valid, high-confidence A/B tests before implementation begins.
Enables Claude to investigate production issues by searching logs, querying metrics, and analyzing distributed traces via the Datadog platform.
Diagnoses and permanently resolves AI hallucinations, instruction drift, and component failures through forensic log analysis.
Manages IT infrastructure, service reliability, and incident response through automation and comprehensive observability frameworks.
Implements comprehensive monitoring and distributed tracing for service mesh architectures like Istio and Linkerd.
Configures production-grade monitoring, health probes, and Micrometer metrics for Spring Boot services.
Analyzes file systems to provide detailed metadata, line counts, and content statistics without modifying source code.
Implements distributed tracing with Jaeger and Tempo to monitor request flows and optimize performance across microservices.
Implements measurable reliability targets using SLIs, SLOs, and error budgets to balance system stability with development velocity.
Builds and manages production-grade Grafana dashboards for real-time observability of infrastructure and applications.
Monitors and displays real-time progress, health metrics, and resource consumption for autonomous Ralph Ultra development sessions.
Monitors autonomous development progress, token costs, and story status in a real-time terminal dashboard.
Standardizes the A/B testing workflow through rigorous hypothesis validation, metric definitions, and statistical power checks.
Instruments and monitors LLM applications with advanced tracing, prompt management, and evaluation metrics.
Implements comprehensive tracing, monitoring, and prompt management for LLM applications using the open-source Langfuse platform.
Standardizes the creation and execution of rigorous A/B tests through mandatory validation gates and statistical planning.
Implements comprehensive LLM observability, tracing, and prompt management using the open-source Langfuse platform.
Implements comprehensive observability, tracing, and prompt management for LLM applications using the Langfuse platform.
Measures, analyzes, and optimizes web application performance using Core Web Vitals and advanced profiling techniques.
Optimizes application performance through systematic measurement, analysis, and targeted optimization techniques.
Optimizes application speed and stability through structured measurement, bottleneck analysis, and Core Web Vitals refinement.
Measures, analyzes, and optimizes web application performance using Core Web Vitals and advanced profiling workflows.
Automates the deployment and configuration of centralized logging systems like ELK, Loki, and Splunk for production environments.
Analyzes application logs to identify performance bottlenecks, recurring error patterns, and system anomalies for improved stability.
Analyzes and optimizes system throughput to identify performance bottlenecks and improve resource capacity.
Scroll for more results...