01Automated rubric generation for objective assessment
02Comprehensive bias mitigation for length and self-enhancement
03Chain-of-thought justification for improved scoring reliability
04Direct scoring with calibrated 1-5 scales
05Pairwise comparison with position-swap consistency checks
0610 GitHub stars