Back to glossary

AI GLOSSARY

Model Compression

AI & Machine Learning

A collection of techniques, including pruning, quantization, and distillation, used to reduce the size and computational requirements of a model without significantly sacrificing its performance. Compressed models are faster, cheaper to run, and easier to deploy on devices with limited resources.