International Journal of Systems Science, Год журнала: 2024, Номер unknown, С. 1 - 10
Опубликована: Авг. 18, 2024
In this paper, we investigate a continuous-time linear quadratic stochastic optimal control (LQSOC) problem in an infinite horizon, where diffusion and drift terms of the corresponding system depend on both state variables. light theory, LQSOC is reduced to solving generalised algebraic Riccati equation (GARE). With help existing model-based value iteration (VI) algorithm, propose two data-driven VI algorithms solve GARE. The first one relies transforming into deterministic then by data system. Consequently, algorithm does not need information coefficients has lower complexity. second directly uses generated system, thus it circumvents requirement all coefficients. We also provide convergence proofs these validate through simulation examples.
Язык: Английский