Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/119210
PIRA download icon_1.1View/Download Full Text
Title: Policy iteration reinforcement learning method for continuous-time linear-quadratic mean-field control problems
Authors: Li, N
Li, X 
Xu, ZQ 
Issue Date: Apr-2025
Source: IEEE transactions on automatic control, Apr. 2025, v. 70, no. 4, p. 2690-2697
Abstract: In this article, we employ a policy iteration reinforcement learning (RL) method to study continuous-time linear–quadratic mean-field control problems in infinite horizon. The drift and diffusion terms in the dynamics involve the states, the controls, and their conditional expectations. We investigate the stabilizability and convergence of the RL algorithm using a Lyapunov recursion. Instead of solving a pair of coupled Riccati equations, the RL technique focuses on strengthening an auxiliary function and the cost functional as the objective functions and updating the new policy to compute the optimal control via state trajectories. A numerical example sheds light on the established theoretical results.
Keywords: Linear–quadratic (LQ) problem
Mean-field (MF) optimal problem
Policy iteration
Reinforcement learning (RL)
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE transactions on automatic control 
ISSN: 0018-9286
EISSN: 1558-2523
DOI: 10.1109/TAC.2024.3494656
Rights: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication N. Li, X. Li and Z. Q. Xu, "Policy Iteration Reinforcement Learning Method for Continuous-Time Linear–Quadratic Mean-Field Control Problems," in IEEE Transactions on Automatic Control, vol. 70, no. 4, pp. 2690-2697, April 2025 is available at https://doi.org/10.1109/TAC.2024.3494656.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Li_Policy_Iteration_Reinforcement.pdfPre-Published version1.44 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.