AN ALGORITHM TO OBTAIN AN OPTIMAL STRATEGY FOR THE MARKOV DECISION PROCESSES, WITH PROBABILITY DISTRIBUTION FOR THE PLANNING HORIZON
Keywords:
MDPs, optimization, probabilities and decision making, operation researchAbstract
In this paper we formulate Markov Decision Processes with Random Horizon. We show the optimality equation for this problem, however there may not exist optimal stationary strategies. For the MDP (Markov–Decision–Process), with probability distribution for the planning horizon with infinite support, we show Turnpike Planning Horizon Theorem. We develop an algorithm obtaining an optimal first stage decision. We give some numerical examples.
Downloads
Download data is not yet available.
Downloads
Published
2023-06-07
How to Cite
John, G. E. (2023). AN ALGORITHM TO OBTAIN AN OPTIMAL STRATEGY FOR THE MARKOV DECISION PROCESSES, WITH PROBABILITY DISTRIBUTION FOR THE PLANNING HORIZON. Investigación Operacional, 31(2). Retrieved from https://revistas.uh.cu/invoperacional/article/view/6197
Issue
Section
Articles


