AI GLOSSARY

Circuit

Research & Advanced Concepts

In mechanistic interpretability research, a circuit is a specific subgraph of a neural network, a collection of neurons and their connections, that implements a particular computation or behavior. Identifying circuits helps researchers understand how models process information internally, moving beyond treating neural networks as black boxes toward a more precise, mechanistic understanding of what a model is actually doing.
See also: mechanistic interpretability, interpretability, black-box model.

External reference