Machine Learning & Training

What Is a Markov Decision Process?

A Markov decision process (MDP) is a mathematical framework for modeling sequential decision-making. It consists of states, actions, transition probabilities, and rewards, where the next state depends only on the current state and action. MDPs form the theoretical foundation of much of reinforcement learning.

Further reading

Read more about Markov decision process — articles and blogs from around the web: