Back to glossary
AI GLOSSARY
Model Compression
AI & Machine Learning
A collection of techniques, including pruning, quantization, and distillation, used to reduce the size and computational requirements of a model without significantly sacrificing its performance. Compressed models are faster, cheaper to run, and easier to deploy on devices with limited resources.