About
This skill provides a comprehensive framework for instrumenting RAG (Retrieval-Augmented Generation) pipelines and agent memory systems within your codebase. It enables developers to track exactly what sources were retrieved, evaluate their relevance through granular quality signals, and monitor context window utilization to prevent truncation errors. By implementing standardized spans for query processing, vector searches, and reranking, it transforms the often-opaque RAG process into a transparent, debuggable workflow, allowing for better optimization of LLM responses and precise cost tracking.