Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115768
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Building Environment and Energy Engineeringen_US
dc.creatorWang, Sen_US
dc.creatorZhao, Hen_US
dc.creatorShu, Ten_US
dc.creatorPan, Zen_US
dc.creatorLiang, Gen_US
dc.creatorZhao, Jen_US
dc.date.accessioned2025-10-28T07:33:07Z-
dc.date.available2025-10-28T07:33:07Z-
dc.identifier.issn1949-3053en_US
dc.identifier.urihttp://hdl.handle.net/10397/115768-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication S. Wang, H. Zhao, T. Shu, Z. Pan, G. Liang and J. Zhao, 'Power Smoothing Control for Wind-Storage Integrated Systems With Hierarchical Safe Reinforcement Learning and Curriculum Learning,' in IEEE Transactions on Smart Grid, vol. 16, no. 6, pp. 4606-4619, Nov. 2025 is available at https://doi.org/10.1109/TSG.2025.3593340.en_US
dc.subjectCurriculum learningen_US
dc.subjectDeep reinforcement learningen_US
dc.subjectPower smoothing controlen_US
dc.subjectPrioritized experiment replayen_US
dc.subjectWind storage integrated systemsen_US
dc.titlePower smoothing control for wind-storage integrated systems with hierarchical safe reinforcement learning and curriculum learningen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage4606en_US
dc.identifier.epage4619en_US
dc.identifier.volume16en_US
dc.identifier.issue6en_US
dc.identifier.doi10.1109/TSG.2025.3593340en_US
dcterms.abstractAs the penetration of wind energy increases, the Wind Storage Integrated System (WSIS) has become a critical solution to ensure stable wind power output and maximize economic benefits. However, existing data-based power smoothing control strategies are hard to satisfy the power fluctuation constraint in a complex environment, presenting an inefficient coordinate performance. To address this problem, this paper proposes a novel Hierarchical Safe Deep Reinforcement Learning (HSDRL) control framework for WSIS. The control problem is first reformulated as two interconnected Constrained Markov Decision Processes, and the hierarchical primal-dual-based safe Deep Deterministic Policy Gradient algorithm is proposed to learn the optimal policy that ensures the power output constraint. Furthermore, the curriculum learning is designed and Constraint Violation Prioritized Experience Replay method is proposed to address the unstable convergence issues caused by imbalanced constraint violation and constraint satisfaction experience data. Last, a hierarchical shared feature neural network structure is designed to share the parameters of Q networks at hierarchies and increase learning efficiency. Simulation results in WindFarmSimulator validate the efficacy of the proposed control framework, demonstrating a 15.3% improvement in profit and a 46.0% reduction in fluctuation compared to existing methods.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE transactions on smart grid, Nov. 2025, v. 16, no. 6, p. 4606-4619en_US
dcterms.isPartOfIEEE transactions on smart griden_US
dcterms.issued2025-11-
dc.identifier.scopus2-s2.0-105012117574-
dc.identifier.eissn1949-3061en_US
dc.description.validate202510 bcjzen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.SubFormIDG000298/2025-08-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThis work was supported in part by the National Natural Science Foundation of China under Grant 72331009, Grant 72171206, and Grant 92270105; in part by the Shenzhen Key Lab of Crowd Intelligence Empowered Low-Carbon Energy Network under Grant ZDSYS20220606100601002; in part by the Guangdong Power Grid Company under Grant GDKJXM20231024; in part by the PolyU Direct under Grant P0047700, Grant P0043885, and Grant P0051105; in part by the Shenzhen Natural Science Fund; in part by the Stable Support Plan Program under Grant GXWD20231128112434001; and in part by the Shenzhen Institute of Artificial Intelligence and Robotics for Society. Paper no. TSG-02300-2024en_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Wang_Power_Smoothing_Control.pdfPre-Published version2.28 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.