Back to glossary
AI GLOSSARY
Preference Tuning
Large Language Model (LLM) Terms
A training technique where a model is fine-tuned based on human preferences between pairs of outputs, teaching it to produce responses that people find more helpful, accurate, or appropriate. Preference tuning is a key step in making raw language models behave like useful assistants, and is closely related to reinforcement learning from human feedback.