Autonomy Risk

Risk that an AI system can pursue goals, use tools, adapt plans, or take actions with insufficient human control. Autonomy risk increases with agents, long-horizon tasks, memory, and external tool access. Autonomy risk should be assessed by task duration, tool permissions, reversibility of actions, supervision quality, and ability to recover from errors.

Examples

1.
OpenAI's Preparedness Framework defines Model Autonomy as a risk category with explicit Critical capability thresholds.
2.
Apollo Research's December 2024 paper on in-context scheming documented frontier model behaviors relevant to autonomy risk.
3.
METR specializes in evaluating autonomous task completion capabilities of frontier models, with results published for major releases.

Related terms

Back to glossary

Examples

Related terms

Loading…

Examples

Related terms