Core Concepts
What Is Inference in AI?
Inference is the process of using a trained model to produce output — answering a prompt, classifying an image, or generating text — as opposed to training, which is how the model learns. Because popular models serve many users, the cost, speed, and energy of inference are major practical concerns.
Further reading
Read more about AI inference — articles and blogs from around the web: