Agent Performance Evaluation Framework | Claude Code Skill