A Markov Decision Process (MDP) is defined by a standard set…
A Markov Decision Process (MDP) is defined by a standard set of components that describe how an agent interacts with its environment. Which of the following is NOT a standard component of a Markov Decision Process?