Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/114432
Title: Continuous time q-learning for mean-field control problems
Authors: Wei, X
Yu, X 
Issue Date: Feb-2025
Source: Applied mathematics and optimization, Feb. 2025, v. 91, no. 1, 10
Abstract: This paper studies the q-learning, recently coined as the continuous time counterpart of Q-learning by Jia and Zhou (J Mach Learn Res 24:1–61, 2023), for continuous time mean-field control problems in the setting of entropy-regularized reinforcement learning. In contrast to the single agent’s control problem in Jia and Zhou (J Mach Learn Res 24:1–61, 2023), we reveal that two different q-functions naturally arise in mean-field control problems: (i) the integrated q-function (denoted by q) as the first-order approximation of the integrated Q-function introduced in Gu et al. (Oper Res 71(4):1040–1054, 2023), which can be learnt by a weak martingale condition using all test policies; and (ii) the essential q-function (denoted by qe) that is employed in the policy improvement iterations. We show that two q-functions are related via an integral representation. Based on the weak martingale condition and our proposed searching method of test policies, some model-free learning algorithms are devised. In two examples, one in LQ control framework and one beyond LQ control framework, we can obtain the exact parameterization of the optimal value function and q-functions and illustrate our algorithms with simulation experiments.
Keywords: Continuous time reinforcement learning
Integrated q-function
Mean-field control
Test policies
Weak martingale characterization
Publisher: Springer New York LLC
Journal: Applied mathematics and optimization 
ISSN: 0095-4616
EISSN: 1432-0606
DOI: 10.1007/s00245-024-10205-7
Appears in Collections:Journal/Magazine Article

Open Access Information
Status embargoed access
Embargo End Date 2025-12-17
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.