018-qkv-matrices

The Q, K, V Matrices

Source: https://arpitbhayani.me/blogs/qkv-matrices Date: 2025-11-26

At the core of the attention mechanism in LLMs are three matrices: Query, Key, and Value. These matrices are how transformers actually pay attention to different parts of the input. In this write-up, we will go through the construction of these matrices from the ground up.

At the core of the attention mechanism in LLMs are three matrices: Query, Key, and Value. These matrices are how transformers actually pay attention to different parts of the input. In this write-up, we will go through the construction of these matrices from the ground up.

Why Q, K, V Matrices Matter

When we read a sentence like “The cat sat on the mat because it was comfortable,” our brain automatically knows that “it” refers to “the mat” and not “the cat.” This is attention in action. Our brain is selectively focusing on relevant words to understand the context.

The Q, K, V Matrices

Source: https://arpitbhayani.me/blogs/qkv-matrices Date: 2025-11-26

At the core of the attention mechanism in LLMs are three matrices: Query, Key, and Value. These matrices are how transformers actually pay attention to different parts of the input. In this write-up, we will go through the construction of these matrices from the ground up.

The Q, K, V Matrices

Why Q, K, V Matrices Matter

018-qkv-matrices

The Q, K, V Matrices

Why Q, K, V Matrices Matter

The Intuition

Attention Pipeline

A Simple Example

The Weight Matrices

Constructing the Query matrix

Constructing the Key matrix

Constructing the Value matrix

Construction Pseudocode

Why Separate Weight Matrices

Impact of Chosen Dimension

Role of Matrices in Attention

The First Step

Footnote