SAIL

/

Deploy - Runtime Guardrails

/

Denial-of-Service (Resource Exhaustion)

5.11

.

Denial-of-Service (Resource Exhaustion)

sail
5.11
Risk

Denial-of-Service (Resource Exhaustion)

Description

Attackers overwhelm inference endpoints with excessive or costly queries, causing slowdown or outages.

Example

Flooding an LLM endpoint with many parallel requests or resource-heavy prompts.

Assets Affected

Model Inference endpoint

AI platform

Mitigation
  • Rate limiting
  • Input complexity analysis
  • Autoscaling
  • Anomaly detection
  • WAF
Standards Mapping
  • ISO 42001: A.6.2.6, A.4.5
  • OWASP Top 10 for LLM: LLM10
  • NIST AI RMF: MEASURE 2.6, MANAGE 1.2
  • DASF v2: MODEL SERVING 9.7