Eval Gap Finder - Claude Code Skill for AI Benchmarking