AI GLOSSARY

Capability Overhang

Safety, Alignment & Ethics

A situation where an AI system has latent capabilities that have not yet been discovered or activated, either because they have not been tested for, or because a small change in training or prompting could unlock significantly more powerful behavior. Capability overhangs are a safety concern because they mean a system's true capabilities may be substantially greater than its observed behavior suggests.
See also: AI safety, emergent behavior, AI alignment.