Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118214
DC FieldValueLanguage
dc.contributorDepartment of Civil and Environmental Engineering-
dc.creatorYao, Z-
dc.creatorZhu, Q-
dc.creatorQin, P-
dc.creatorLuo, M-
dc.date.accessioned2026-03-23T07:41:31Z-
dc.date.available2026-03-23T07:41:31Z-
dc.identifier.issn0920-8542-
dc.identifier.urihttp://hdl.handle.net/10397/118214-
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.subjectDistributed consensusen_US
dc.subjectMulti-agent systemsen_US
dc.subjectReinforcement learningen_US
dc.subjectStochastic Markov jump systemsen_US
dc.titleHomotopic reinforcement learning for distributed consensus control of stochastic Markov jump multi-agent systemsen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume81-
dc.identifier.issue15-
dc.identifier.doi10.1007/s11227-025-07885-5-
dcterms.abstractThis paper investigates the optimized consensus problem for a class of stochastic Markov jump multi-agent systems, with particular attention to the high computational demands that arise in large-scale implementations. Firstly, an error system is constructed based on the consensus objective, and a min-max strategy is introduced to transform the consensus problem into an optimized control problem of the error system. Subsequently, a set of parallel coupled game Lyapunov equations are developed to design the consensus controller, whose solution naturally requires significant parallel computation resources. Furthermore, to address the challenges posed by unknown system dynamics and the difficulty of obtaining an initial stable controller, a novel model-free consensus control approach based on homotopic reinforcement learning is proposed. By collecting state and input data, the proposed method enables the online computation of closed-loop stable controllers and optimized consensus controllers in a scalable and distributed manner. Finally, a numerical example is presented to demonstrate the effectiveness of the proposed approach, highlighting its suitability for real-time implementation on high-performance computing platforms.-
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationJournal of supercomputing, Oct. 2025, v. 81, no. 15, 1446-
dcterms.isPartOfJournal of supercomputing-
dcterms.issued2025-10-
dc.identifier.scopus2-s2.0-105018669109-
dc.identifier.artn1446-
dc.description.validate202603 bcjz-
dc.description.oaNot applicableen_US
dc.identifier.SubFormIDG001297/2026-02en_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported by Youth Talent Project of Scientific Research Program of Hubei Provincial Department of Education under Grant Q20241809 and Doctoral Scientific Research Foundation of Hubei University of Automotive Technology under Grant BK202404.en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2026-10-30en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2026-10-30
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.