Back to glossary
AI GLOSSARY
Model Serving
Deployment & Infrastructure
The infrastructure and processes that make a trained model available to handle prediction requests from applications or users. Model serving involves packaging the model, exposing it via an API, managing resources, and ensuring it can handle the expected volume of requests reliably.