Back to glossary

AI GLOSSARY

Streaming Output

Large Language Model (LLM) Terms

A delivery method where a language model's response is sent to the user token by token as it is generated, rather than waiting for the complete response before displaying anything. Streaming is why text from AI assistants appears to type itself out in real time, making interactions feel faster and more natural, and allowing users to start reading before generation is finished.