Back to glossary
AI GLOSSARY
Inference Cost
Business & Product
The computational cost of running a trained AI model to generate outputs. Every user interaction triggers inference, and at scale these costs can become significant, making inference cost a key factor in decisions about which models to deploy, at what context length, and under what pricing structure.
See also: inference, FLOPS, hosted model.