Back to glossaryExternal reference
AI GLOSSARY
Circuit
Research & Advanced Concepts
In mechanistic interpretability research, a circuit is a specific subgraph of a neural network, a collection of neurons and their connections, that implements a particular computation or behavior. Identifying circuits helps researchers understand how models process information internally, moving beyond treating neural networks as black boxes toward a more precise, mechanistic understanding of what a model is actually doing.
See also: mechanistic interpretability, interpretability, black-box model.