Language & LLMs
What Is Cross-Attention?
Cross-attention is an attention mechanism where queries come from one sequence and keys and values come from a different sequence. It is commonly used in encoder-decoder models so the decoder can focus on relevant parts of the encoder's output. This is essential for tasks like machine translation.
Further reading
Read more about cross-attention — articles and blogs from around the web: