Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108689
PIRA download icon_1.1View/Download Full Text
Title: Performance assessment and comparative analysis of photovoltaic-battery system scheduling in an existing zero-energy house based on reinforcement learning control
Authors: Xu, W
Li, Y 
He, G
Xu, Y
Gao, W
Issue Date: Jul-2023
Source: Energies, July 2023, v. 16, no. 13, 4844
Abstract: The development of distributed renewable energy resources and smart energy management are efficient approaches to decarbonizing building energy systems. Reinforcement learning (RL) is a data-driven control algorithm that trains a large amount of data to learn control policy. However, this learning process generally presents low learning efficiency using real-world stochastic data. To address this challenge, this study proposes a model-based RL approach to optimize the operation of existing zero-energy houses considering PV generation consumption and energy costs. The model-based approach takes advantage of the inner understanding of the system dynamics; this knowledge improves the learning efficiency. A reward function is designed considering the physical constraints of battery storage, photovoltaic (PV) production feed-in profit, and energy cost. Measured data of a zero-energy house are used to train and test the proposed RL agent control, including Q-learning, deep Q network (DQN), and deep deterministic policy gradient (DDPG) agents. The results show that the proposed RL agents can achieve fast convergence during the training process. In comparison with the rule-based strategy, test cases verify the cost-effectiveness performances of proposed RL approaches in scheduling operations of the hybrid energy system under different scenarios. The comparative analysis of test periods shows that the DQN agent presents better energy cost-saving performances than Q-learning while the Q-learning agent presents more flexible action control of the battery with the fluctuation of real-time electricity prices. The DDPG algorithm can achieve the highest PV self-consumption ratio, 49.4%, and the self-sufficiency ratio reaches 36.7%. The DDPG algorithm outperforms rule-based operation by 7.2% for energy cost during test periods.
Keywords: Battery storage
Energy cost
PV consumption
Reinforcement learning
Reward design
Publisher: MDPI AG
Journal: Energies 
EISSN: 1996-1073
DOI: 10.3390/en16134844
Rights: © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
The following publication Xu W, Li Y, He G, Xu Y, Gao W. Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control. Energies. 2023; 16(13):4844 is available at https://doi.org/10.3390/en16134844.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
energies-16-04844.pdf8.89 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

86
Citations as of Nov 10, 2025

Downloads

20
Citations as of Nov 10, 2025

SCOPUSTM   
Citations

12
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

3
Citations as of Feb 13, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.