01Real-time classification of inbound messages into ALLOW, WARN, or BLOCK categories
021 GitHub stars
03Automated detection of prompt injection and safety bypass attempts
04Tiered permission evaluation supporting READ_ONLY, WRITE_LOCAL, and FULL_ACCESS levels
05Structured JSON output providing classification rationale and suggested access tiers
06Context-aware screening that accounts for prior user warnings and history