Back to glossary

AI GLOSSARY

Model Serving

Deployment & Infrastructure

The infrastructure and processes that make a trained model available to handle prediction requests from applications or users. Model serving involves packaging the model, exposing it via an API, managing resources, and ensuring it can handle the expected volume of requests reliably.