Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/108208
PIRA download icon_1.1View/Download Full Text
Title: GEIN : an interpretable benchmarking framework towards all building types based on machine learning
Authors: Jin, X 
Xiao, F 
Zhang, C 
Li, A 
Issue Date: 1-Apr-2022
Source: Energy and buildings, 1 Apr. 2022, v. 260, 111909
Abstract: Building energy performance benchmarking is adopted by many countries in the world as an effective tool to reduce energy consumption at city or country level. Machine learning holds a lot of promise for quickly and correctly predicting energy consumption from massive data, thereby it’s suitable for large-scale performance assessment. However, there is a severe problem of data imbalance in building types in many datasets. Due to the lack of samples for some types of buildings, unfavorable results, such as low accuracy of prediction, are produced sometimes. Meanwhile, the poor interpretability of machine learning models makes it difficult to promote the benchmarking frameworks based on machine learning. Therefore, this study proposed a novel machine learning based building performance benchmarking framework with improved generalization and interpretability. A reliable and convenient data augmentation approach was established to overcome the data imbalance problem while avoiding the overfitting problem. Superior results were obtained in case studies using three city-level open-source building datasets from two different countries. A complete rating framework was also proposed, with proper explanations of results at sample level. The performance of this rating framework was verified by comparing with other data-driven benchmarking frameworks. Moreover, the importance of variables was quantified and ranked, which can be a significant reference for data collectors and publishers. The results demonstrated that data augmentation can effectively solve the problem of data imbalance, which enables the universality of machine learning based benchmarking on all types of buildings. And the proposed GEIN benchmarking framework can also effectively address the issues of interpretability.
Keywords: Data augmentation
EUI prediction
GEIN
Interpretable building energy benchmarking
Machine learning
Publisher: Elsevier BV
Journal: Energy and buildings 
ISSN: 0378-7788
EISSN: 1872-6178
DOI: 10.1016/j.enbuild.2022.111909
Rights: © 2022 Elsevier B.V. All rights reserved.
© 2022. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/
The following publication Jin, X., Xiao, F., Zhang, C., & Li, A. (2022). GEIN: An interpretable benchmarking framework towards all building types based on machine learning. Energy and Buildings, 260, 111909 is available at https://doi.org/10.1016/j.enbuild.2022.111909.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Jin_GEIN_Interpretable_Benchmarking.pdfPre-Published version2.96 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

118
Citations as of Nov 10, 2025

Downloads

82
Citations as of Nov 10, 2025

WEB OF SCIENCETM
Citations

31
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.