01Adversarial Robustness Assessment (RobustBench, AdvGLUE)
02Jailbreak & Prompt Injection Testing (JailbreakBench, AdvBench)
031 GitHub stars
04Standardized Safety Evaluation (HarmBench, ToxiGen, TruthfulQA)
05Privacy & Data Extraction Audits (Membership Inference, Model Inversion)
06Mapping to OWASP LLM 2025 and NIST AI RMF standards