Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/116189
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.contributorInternational Centre of Urban Energy Nexusen_US
dc.contributorPolicy Research Centre for Innovation and Technologyen_US
dc.contributorResearch Institute for Smart Energyen_US
dc.contributorMainland Development Officeen_US
dc.creatorWang, Ren_US
dc.creatorBi, Xen_US
dc.creatorBu, Sen_US
dc.creatorTang, Zen_US
dc.date.accessioned2025-11-26T04:20:57Z-
dc.date.available2025-11-26T04:20:57Z-
dc.identifier.issn2162-237Xen_US
dc.identifier.urihttp://hdl.handle.net/10397/116189-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication R. Wang, X. Bi, S. Bu and Z. Tang, 'Deep Reinforcement Learning Approach for Dynamic Distribution Network Reconfiguration Based on Sequential Masking,' in IEEE Transactions on Neural Networks and Learning Systems, vol. 36, no. 10, pp. 19270-19284, Oct. 2025 is available at https://doi.org/10.1109/TNNLS.2025.3574208.en_US
dc.subjectDeep reinforcement learning (DRL)en_US
dc.subjectDynamic distribution network reconfiguration (DDNR)en_US
dc.subjectSequential maskingen_US
dc.subjectSoft actor critic (SAC)en_US
dc.titleDeep reinforcement learning approach for dynamic distribution network reconfiguration based on sequential maskingen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage19270en_US
dc.identifier.epage19284en_US
dc.identifier.volume36en_US
dc.identifier.issue10en_US
dc.identifier.doi10.1109/TNNLS.2025.3574208en_US
dcterms.abstractDynamic distribution network reconfiguration (DDNR) is a widely used technique for the secure and economic operation of power distribution networks (PDNs), especially in the presence of high-penetration renewable energy sources (RESs). DDNR is realized by controlling the on/off status of remotely controlled switches (RCSs) equipped at power lines in PDNs to optimize power flows. Thanks to the enhanced data availability of PDNs, data-driven solutions to DDNR, such as deep reinforcement learning (DRL), have gained growing attention recently. However, DDNR solves a sequence of combinatorial problems featuring a vast and sparse action space incurred by a so-called “radiality constraint,” which is highly challenging for DRLs to handle. Existing DRL methods are either unscalable to large-scale problems or potentially restrict optimality. Hence, we propose a sequential masking strategy to decompose its complex action space into a sequence of maskable sub-action spaces. A gated recurrent unit (GRU)-based agent and an adapted soft actor critic (SAC) algorithm are designed accordingly, producing a data-efficient, safety-guaranteed, and scalable DRL solution to the DDNR problem. Comprehensive comparisons with existing data-driven methods and model-based benchmarks are conducted via various case studies, demonstrating the advantages of the proposed method in both algorithmic performance and scalability.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE transactions on neural networks and learning systems, Oct. 2025, v. 36, no. 10, p. 19270-19284en_US
dcterms.isPartOfIEEE transactions on neural networks and learning systemsen_US
dcterms.issued2025-10-
dc.identifier.scopus2-s2.0-105007713620-
dc.identifier.eissn2162-2388en_US
dc.description.validate202511 bcjzen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.SubFormIDG000388/2025-07-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextPolyU for RISE Seed Project (Grant Number: U-CDC8); PolyU for PReCIT Seed Project (Grant Number: 1-CE16); PolyU for Intra-Faculty Interdisciplinary Project (Grant Number: 1-WZ4L); Beijing Normal-Hong Kong Baptist University for Start-up Fund (Grant Number: UICR0700116-25)en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Wang_Deep_Reinforcement_Learning.pdfPre-Published version4.73 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.