Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/108689
| Title: | Performance assessment and comparative analysis of photovoltaic-battery system scheduling in an existing zero-energy house based on reinforcement learning control | Authors: | Xu, W Li, Y He, G Xu, Y Gao, W |
Issue Date: | Jul-2023 | Source: | Energies, July 2023, v. 16, no. 13, 4844 | Abstract: | The development of distributed renewable energy resources and smart energy management are efficient approaches to decarbonizing building energy systems. Reinforcement learning (RL) is a data-driven control algorithm that trains a large amount of data to learn control policy. However, this learning process generally presents low learning efficiency using real-world stochastic data. To address this challenge, this study proposes a model-based RL approach to optimize the operation of existing zero-energy houses considering PV generation consumption and energy costs. The model-based approach takes advantage of the inner understanding of the system dynamics; this knowledge improves the learning efficiency. A reward function is designed considering the physical constraints of battery storage, photovoltaic (PV) production feed-in profit, and energy cost. Measured data of a zero-energy house are used to train and test the proposed RL agent control, including Q-learning, deep Q network (DQN), and deep deterministic policy gradient (DDPG) agents. The results show that the proposed RL agents can achieve fast convergence during the training process. In comparison with the rule-based strategy, test cases verify the cost-effectiveness performances of proposed RL approaches in scheduling operations of the hybrid energy system under different scenarios. The comparative analysis of test periods shows that the DQN agent presents better energy cost-saving performances than Q-learning while the Q-learning agent presents more flexible action control of the battery with the fluctuation of real-time electricity prices. The DDPG algorithm can achieve the highest PV self-consumption ratio, 49.4%, and the self-sufficiency ratio reaches 36.7%. The DDPG algorithm outperforms rule-based operation by 7.2% for energy cost during test periods. | Keywords: | Battery storage Energy cost PV consumption Reinforcement learning Reward design |
Publisher: | MDPI AG | Journal: | Energies | EISSN: | 1996-1073 | DOI: | 10.3390/en16134844 | Rights: | © 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). The following publication Xu W, Li Y, He G, Xu Y, Gao W. Performance Assessment and Comparative Analysis of Photovoltaic-Battery System Scheduling in an Existing Zero-Energy House Based on Reinforcement Learning Control. Energies. 2023; 16(13):4844 is available at https://doi.org/10.3390/en16134844. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| energies-16-04844.pdf | 8.89 MB | Adobe PDF | View/Open |
Page views
86
Citations as of Nov 10, 2025
Downloads
20
Citations as of Nov 10, 2025
SCOPUSTM
Citations
12
Citations as of Dec 19, 2025
WEB OF SCIENCETM
Citations
3
Citations as of Feb 13, 2025
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



