Core Concepts

What Is a Transformer in AI?

The transformer is a neural network architecture introduced in the 2017 paper “Attention Is All You Need.” It uses an attention mechanism to weigh the relationships between all parts of an input at once, which made it far more scalable than earlier designs. Transformers are the foundation of today’s large language and multimodal models.

Go deeper: Evolution of AI

Further reading

Read more about Transformer model — articles and blogs from around the web: