Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115256
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineering-
dc.creatorLi, Gen_US
dc.creatorOr, SWen_US
dc.date.accessioned2025-09-17T03:46:41Z-
dc.date.available2025-09-17T03:46:41Z-
dc.identifier.issn0959-6526en_US
dc.identifier.urihttp://hdl.handle.net/10397/115256-
dc.language.isoenen_US
dc.publisherElsevier BVen_US
dc.subjectHybrid energy storage systemsen_US
dc.subjectMulti-agent deep reinforcement learningen_US
dc.subjectMulti-time scale energy managementen_US
dc.subjectPhotovoltaicsen_US
dc.subjectUrban rail transitsen_US
dc.titleMulti-agent deep reinforcement learning-based multi-time scale energy management of urban rail traction networks with distributed photovoltaic–regenerative braking hybrid energy storage systemsen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume466en_US
dc.identifier.doi10.1016/j.jclepro.2024.142842en_US
dcterms.abstractThe integration of photovoltaics (PVs), regenerative braking (RB) techniques, and energy storage devices has become crucial to promote energy conservation and emission reduction for a sustainable future of urban rail traction networks (URTNs). This paper proposes a tri-level multi-time scale energy management framework for the economic and low-carbon operation of URTNs with PV–RB hybrid energy storage systems (HESSs) based on multi-agent deep reinforcement learning (MADRL). A two-stage stochastic scheduling approach is developed to minimize daily operation and carbon trading costs at the upper level and correct day-ahead scheduling deviations against multi-source uncertainties at the middle level. A MADRL-based real-time energy management strategy is established to optimize the PV–RB power flow and promote its utilization by coordinating distributed HESSs at the lower level. The HESS control problem is formulated as a decentralized partially observable Markov decision process and solved by a multi-agent control algorithm based on monotonic value function factorization. A Copula-based spatio-temporal dependency model is devised to characterize the PV, passenger flow, and traction load uncertainties and generate daily URTN operation scenarios for enhancing day-ahead and intraday decisions. Comparative studies demonstrate the effectiveness of the proposed framework in terms of a cost reduction by 11.98% and a PV–RB energy utilization improvement by 13.94%.-
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationJournal of cleaner production, 10 Aug. 2024, v. 466, 142842en_US
dcterms.isPartOfJournal of cleaner productionen_US
dcterms.issued2024-08-10-
dc.identifier.scopus2-s2.0-85196006886-
dc.identifier.eissn1879-1786en_US
dc.identifier.artn142842en_US
dc.description.validate202509 bcch-
dc.identifier.FolderNumbera4037b-
dc.identifier.SubFormID51982-
dc.description.fundingSourceRGCen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported in part by the Research Grants Council of the HKSAR Government (Grant No. R5020-18), and in part by the Innovation and Technology Commission of the HKSAR Government to the Hong Kong Branch of National Rail Transit Electrification and Automation Engineering Technology Research Center (Grant No. K-BBY1).en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2026-08-10en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2026-08-10
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.