Core Concepts
What Is a Mixture of Experts (MoE)?
A mixture of experts (MoE) is a model design in which many specialized sub-networks (“experts”) exist, but only a few are activated for any given input by a routing mechanism. This lets a model have very large total capacity while using less computation per query, and it features in several recent high-performing models.
Further reading
Read more about Mixture of experts — articles and blogs from around the web: