Back to glossary

AI GLOSSARY

Bias (Data)

AI & Machine Learning

Systematic distortion in a dataset that causes a model trained on it to produce skewed or unrepresentative outputs. Data bias can enter at many points, through unrepresentative sampling, historically prejudiced labels, measurement errors, or simply the way data was collected and curated. Because a model can only learn from what it is shown, biased input reliably produces biased output, regardless of how well the model itself is built.
See also: algorithmic bias, bias mitigation, training data.

External reference