Risk
Denial-of-Service (Resource Exhaustion)
Description
Attackers overwhelm inference endpoints with excessive or costly queries, causing slowdown or outages.
Example
Flooding an LLM endpoint with many parallel requests or resource-heavy prompts.
Assets Affected
Model Inference endpoint
AI platform
Mitigation
- Rate limiting
- Input complexity analysis
- Autoscaling
- Anomaly detection
- WAF
Standards Mapping
- ISO 42001: A.6.2.6, A.4.5
- OWASP Top 10 for LLM: LLM10
- NIST AI RMF: MEASURE 2.6, MANAGE 1.2
- DASF v2: MODEL SERVING 9.7