Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/119210
| Title: | Policy iteration reinforcement learning method for continuous-time linear-quadratic mean-field control problems | Authors: | Li, N Li, X Xu, ZQ |
Issue Date: | Apr-2025 | Source: | IEEE transactions on automatic control, Apr. 2025, v. 70, no. 4, p. 2690-2697 | Abstract: | In this article, we employ a policy iteration reinforcement learning (RL) method to study continuous-time linear–quadratic mean-field control problems in infinite horizon. The drift and diffusion terms in the dynamics involve the states, the controls, and their conditional expectations. We investigate the stabilizability and convergence of the RL algorithm using a Lyapunov recursion. Instead of solving a pair of coupled Riccati equations, the RL technique focuses on strengthening an auxiliary function and the cost functional as the objective functions and updating the new policy to compute the optimal control via state trajectories. A numerical example sheds light on the established theoretical results. | Keywords: | Linear–quadratic (LQ) problem Mean-field (MF) optimal problem Policy iteration Reinforcement learning (RL) |
Publisher: | Institute of Electrical and Electronics Engineers | Journal: | IEEE transactions on automatic control | ISSN: | 0018-9286 | EISSN: | 1558-2523 | DOI: | 10.1109/TAC.2024.3494656 | Rights: | © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication N. Li, X. Li and Z. Q. Xu, "Policy Iteration Reinforcement Learning Method for Continuous-Time Linear–Quadratic Mean-Field Control Problems," in IEEE Transactions on Automatic Control, vol. 70, no. 4, pp. 2690-2697, April 2025 is available at https://doi.org/10.1109/TAC.2024.3494656. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Li_Policy_Iteration_Reinforcement.pdf | Pre-Published version | 1.44 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



