This paper considers the maximization of certain equivalent reward generated by a Markov decision process with constant risk sensitivity. First, value iteration is used to optimize possibly ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results