About
This skill provides a robust framework for handling data overlap in multi-source environments, such as news aggregators, product catalogs, or event feeds. It goes beyond simple URL matching by implementing semantic similarity grouping, source reputation scoring, and canonical version selection. By leveraging hash-based grouping and customizable preference logic, it ensures your application always presents the most authoritative and complete version of a record while providing detailed metrics on data reduction and optimization.