5.13 .

Malicious Content Generation

sail

5.13

Risk

Malicious Content Generation

Description

Model generates harmful, offensive, policy-violating, or illegal content due to insufficient runtime filtering or prompt design.

Example

Model generates hate speech or copyrighted material in response to user queries.

Assets Affected

Model Response

Model Inference endpoint

Mitigation

Standards Mapping