AI GLOSSARY

Corrigibility

Safety, Alignment & Ethics

The property of an AI system that makes it amenable to correction, modification, or shutdown by its operators, even if doing so conflicts with the system's current objectives. A corrigible AI does not resist being turned off or having its goals changed, which is considered a desirable safety property, especially during the early stages of developing powerful AI systems.
See also: AI alignment, AI containment, control problem.

External reference