Discover Agent Skills for analytics & monitoring. Browse 47 skills for Claude, ChatGPT & Codex.
Analyzes LLM performance metrics and score trends directly within Claude Code using Langfuse observability data.
Fetches and analyzes Langfuse traces with customizable output modes to debug LLM workflows and monitor application performance.
Analyzes tracked behaviors and outcomes to generate actionable insights and data-driven feedback loops.
Manages Langfuse datasets for AI regression testing and golden set curation directly through the Claude Code CLI.
Analyzes multi-turn conversation flows and session-level metrics within Langfuse to debug user journeys and track LLM performance.
Manages human annotations and manual scoring workflows for Langfuse LLM traces directly from Claude.
Orchestrates end-to-end evaluation cycles for AI agents using Langfuse to identify performance regressions and generate actionable optimization reports.
Provides deep insights into multi-turn LLM conversations by analyzing and debugging Langfuse trace sessions.
Manages the complete lifecycle of Langfuse LLM prompts, including version control, deployment labels, and side-by-side version comparisons.
Provides strategic guidance for evaluating and optimizing AI agents using Langfuse traces and data-driven iteration loops.
Analyzes and visualizes LLM quality scores, trends, and regressions within the Langfuse observability platform.
Diagnoses and resolves LLM workflow issues by performing structured root-cause analysis on Langfuse traces.
Instruments Python LLM pipelines with Langfuse tracing, observability patterns, and performance scoring.
Extracts and filters Langfuse observability traces to provide surgical debugging insights directly within the Claude Code environment.
Instruments Python applications with Langfuse tracing to provide deep observability into LLM calls, pipelines, and agentic workflows.
Manages human annotations and quality scores for Langfuse traces directly within the Claude Code environment.
Diagnoses AI workflow issues by correlating Langfuse trace data with codebase logic to identify root causes and generate actionable fixes.
Identifies code performance bottlenecks, evaluates resource efficiency, and provides actionable optimization strategies.
Retrieves real-time account usage information and quota statistics for the GLM Coding Plan.
Submits detailed conversation context and user feedback to help debug and improve AI-driven development workflows.
Generates professional PDF and PowerPoint location analytics reports from PinMeTo data for board-ready business reviews.
Delivers professional performance audits, bottleneck identification, and scalability assessments without modifying source code.
Conducts expert-level observability audits, logging reviews, and monitoring strategy assessments to improve system visibility.
Configures Prometheus for end-to-end metric collection, alerting, and infrastructure monitoring across diverse environments.
Provides a comprehensive knowledge base and data models for ecommerce analytics, SEM performance, and cross-channel attribution scoring.
Monitors and visualizes the real-time status of Claude Code agents across multiple WezTerm terminal panes.
Implements measurable reliability targets using SLIs, SLOs, and error budgets to balance service stability with development velocity.
Implements comprehensive monitoring, distributed tracing, and visualization for Istio and Linkerd service mesh environments.
Analyzes and optimizes web application performance across bundles, React rendering, databases, and network requests.
Builds and manages production-grade Grafana dashboards for real-time system observability and metric visualization.
Scroll for more results...