Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/112192
PIRA download icon_1.1View/Download Full Text
Title: Active learning concerning sampling cost for enhancing AI-enabled building energy system modeling
Authors: Li, A 
Xiao, F 
Xiao, ZW 
Yan, R
Li, AB
Lv, Y
Su, B
Issue Date: Dec-2024
Source: Advances in applied energy, Dec. 2024, v. 16, 100189
Abstract: Machine learning is widely recognized as a promising data-driven modeling technique for the model-based control and optimization of building energy systems. However, the generalizability of data-driven models often faces significant challenges, as the available training data from building operations usually only covers a limited range of working conditions. Active learning can proactively test unseen and informative working conditions to enrich the training set by adding new data samples, leading to improved generalization performance of data-driven models. A novel distance and information density-based sample strategy is developed that accounts for the real-time status of building operation and outdoor environment. Based on Mahalanobis distance, this strategy determines the sampling value of an unlabeled sample (unseen working condition) by assessing its similarity to both the training samples and other unlabeled samples. As collecting sufficiently representative samples can be difficult, costly, and time-consuming, a distance-based sampling cost metric is proposed to compare the efficiency of different sampling methods, considering the detrimental effects of the actively sampling process on the normal operation of building energy systems. This paper presents a comprehensive and in-depth comparison of five active learning methods, including one incorporating the distance-based sampling strategy, by conducting data experiments on the data collected from the cooling towers of a real high-rise building. The results show that active learning can effectively identify informative data samples and improve the generalization performance of data-driven models. The research outcomes are valuable for enhancing AI- enabled data-driven modeling of building energy systems with substantial decreases in costs on data sampling.
Keywords: Building energy system
Building control
Model-based optimization
Data-driven modeling
Machine learning
Active learning
Publisher: Elsevier Ltd
Journal: Advances in applied energy 
EISSN: 2666-7924
DOI: 10.1016/j.adapen.2024.100189
Rights: © 2024 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/bync-nd/4.0/).
The following publication Li, A., Xiao, F., Xiao, Z., Yan, R., Li, A., Lv, Y., & Su, B. (2024). Active learning concerning sampling cost for enhancing AI-enabled building energy system modeling. Advances in Applied Energy, 16, 100189 is available at https://doi.org/10.1016/j.adapen.2024.100189.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
1-s2.0-S2666792424000271-main.pdf8.83 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

WEB OF SCIENCETM
Citations

1
Citations as of Apr 3, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.