context_window

Context Window

128K ChatGPT Context Window

“Larger context basically means you can write a larger text prompt, and get a larger and more detailed response back. So you could for example copy the text from multiple pages from a book (up to 300 pages, if the claims from the announcement are accurate), and then ask it to summarize the content, analyze, identify key points or themes, etc.” https://www.reddit.com/r/ChatGPT/comments/17pa61n/what_does_the_128k_context_window_mean_for/

“It also means that the AI will remember more of your long conversations. For example, let’s say you ask it to give you ideas for a story and it says, “This is a character named Paul whose brother is named Lenny.”” https://www.reddit.com/r/ChatGPT/comments/17pa61n/what_does_the_128k_context_window_mean_for/

“Then you keep asking for more and more details about the story, and it comes up with a story about Paul traveling to France and doing all of these interesting things. If you chat long enough and then ask it for the name of Paul’s brother, that first message could land outside of the context window, which means the AI will forget the answer it previously gave you. It might reply that Paul’s brother is named Dave, or it might even say that Paul doesn’t have a brother.” https://www.reddit.com/r/ChatGPT/comments/17pa61n/what_does_the_128k_context_window_mean_for/

“A longer context window allows you to have much longer conversations before it starts to “forget” things.” https://www.reddit.com/r/ChatGPT/comments/17pa61n/what_does_the_128k_context_window_mean_for/

The largest models, such as Google's Gemini 1.5, presented in February 2024, can have a context window sized up to 1 million (context window of 10 million was also “successfully tested”).

Snippet from Wikipedia: Context window

The context window of a large language model (LLM) is the maximum amount of text or other tokenized input available to the model at one time when generating output. It is usually measured in tokens, which are units produced by the model's tokenizer rather than words or characters. In practical terms, the context window is the material the model can "see" while producing a response; anything outside that window is not directly available unless it is summarized, retrieved, or provided again. A longer context window can allow a model to work with longer prompts, conversations, documents, codebases, or retrieved passages without first compressing or discarding as much information.

The practical size of context windows has increased rapidly as LLM systems have developed. Some models are limited by the sequence lengths used during training, while attention variants and positional-encoding methods can allow models to operate on longer sequences than those seen during training. By the mid-2020s, long-context systems had reported context windows ranging from hundreds of thousands to millions of tokens; Google researchers reported Gemini 1.5 evaluations on retrieval tasks at up to 10 million tokens.

A larger context window does not necessarily mean that a model can use the entire context equally well. In "Lost in the Middle", Liu et al. found that performance on long-context tasks was often worse when relevant information appeared in the middle of an input rather than near the beginning or end. Other benchmarks have assessed long- context capability using tasks that go beyond simple retrieval, including multi-document question answering, long-dialogue understanding, code repository understanding, and structured-data reasoning.

context_window.txt · Last modified: 2025/02/01 07:07 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki