DPO Preference Optimization - Claude Code Skill