GistGarden
Artificial Intelligence Difficulty 65/100

Attention

Spotlight on what matters most.

Attention monster
Spotlight on what matters most.
⚡ The 5-second answer

Attention is a mechanism that lets a model focus on the most relevant parts of input data when making predictions.

Explain like I'm five

Imagine you're reading a recipe and need to find how much sugar to add. Instead of reading every word again, you scan directly to the 'sugar' part and pay close attention to that number. Attention does the same for AI: it highlights the important pieces of information and ignores the rest.

Why it matters

Attention is the core of modern AI systems like ChatGPT and Google Translate, making them understand context and long sentences. Without it, models would get lost in long texts and miss crucial details.

Common misconception

Many think attention means the model is 'thinking' or 'aware' like a human. Actually, it's just a mathematical way to assign weights to different inputs—no consciousness or real focus involved.

Formal definition

Attention computes a weighted sum of values, where weights are derived from a compatibility function between a query and a set of keys. This allows the model to dynamically prioritize different parts of the input sequence, enabling it to capture long-range dependencies in data like text or images.