About
This skill is designed for data engineers and AI developers who maintain 'golden' ground-truth datasets used for RAG evaluation or model benchmarking. It provides a robust framework for validating document and query schemas, detecting near-duplicate content through similarity thresholds, and ensuring comprehensive coverage across domains and difficulty levels. By automating these integrity checks, the skill helps maintain high-fidelity datasets that produce reliable, reproducible performance metrics for AI systems.