This skill automates the process of data discovery by performing a deep-dive analysis of SQL database tables. It extracts schema metadata, calculates type-specific column statistics, analyzes cardinality, and performs a multi-dimensional data quality assessment covering completeness, uniqueness, freshness, validity, and consistency. It is an essential tool for data engineers, analysts, and developers who need to quickly grasp the nuances of an unfamiliar dataset, identify potential data issues, or generate documentation without writing dozens of manual exploratory queries.
Key Features
01Type-specific column statistics for numeric, string, and date types
02Automated schema and metadata extraction from INFORMATION_SCHEMA
03Multi-dimensional data quality scoring and issue identification
0477 GitHub stars
05Representative data sampling and structured summary reporting
06Cardinality and frequency analysis for categorical data distribution