Infrastructure & Agents

What Is Model Serving?

Model serving exposes a trained model behind an interface, usually an API, so software can request predictions. It handles concerns like scaling, latency, and reliability that arise when running models in production.

Further reading

Read more about model serving — articles and blogs from around the web: