Risk
Limited Scope of Evasion Technique Testing
Description
Red teaming misses common evasion tactics like hidden characters or encoding, allowing bypasses.
Example
Prompt injection using zero-width or base64-encoded input evades filters and triggers unintended actions.
Assets Affected
User Prompt
System Prompt
Mitigation
- Expand adversarial tests to include diverse evasion methods
- Regularly fuzz with obfuscated, encoded, and hidden payloads
Standards Mapping
- ISO 42001: A.6.2.4, A.9.2
- OWASP Top 10 for LLM: LLM01
- NIST AI RMF: MEASURE 2.6, MEASURE 2.7