CAI & Generative Media

Context Window

AlsoContext LengthContext SizeToken Limit

The maximum amount of text an LLM can process at once—the model's working memory that limits how much it can 'see' in a conversation.

Context window is the maximum amount of text (measured in tokens) that a language model can consider at once when generating a response.

Size Matters

Larger windows enable:

Limitations:

Roughly: 1 token ≈ 0.75 words (English). A 100K context window holds ~75,000 words or a short novel.

When context exceeds the window: