Abstract: This paper investigates learning linear quadratic regulator of Markov jump linear system(MJLS). Firstly, we propose an estimation method for state-action ...