Back to glossary
AI GLOSSARY
Capability Overhang
Safety, Alignment & Ethics
A situation where an AI system has latent capabilities that have not yet been discovered or activated, either because they have not been tested for, or because a small change in training or prompting could unlock significantly more powerful behavior. Capability overhangs are a safety concern because they mean a system's true capabilities may be substantially greater than its observed behavior suggests.
See also: AI safety, emergent behavior, AI alignment.