Discover Agent Skills for security & testing. Browse 105 skills for Claude, ChatGPT & Codex.
Provides actionable techniques and command-line references for escalating user privileges on Linux and Windows systems.
Guides users through the complete penetration testing lifecycle from initial reconnaissance to professional security reporting.
Enforces a rigorous evidence-based protocol that requires fresh command output before any task is claimed as finished or fixed.
Evaluates and benchmarks LLM agents using behavioral testing, reliability metrics, and production monitoring to ensure consistent performance in real-world scenarios.
Conducts comprehensive security assessments and penetration tests across AWS, Azure, and Google Cloud Platform environments.
End of results