Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/117695
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Building Environment and Energy Engineering | en_US |
| dc.creator | Gao, DC | en_US |
| dc.creator | Zhang, X | en_US |
| dc.creator | Zhang, Y | en_US |
| dc.creator | Gao, Y | en_US |
| dc.creator | Zou, W | en_US |
| dc.creator | Shan, K | en_US |
| dc.date.accessioned | 2026-02-27T02:37:48Z | - |
| dc.date.available | 2026-02-27T02:37:48Z | - |
| dc.identifier.issn | 0360-5442 | en_US |
| dc.identifier.uri | http://hdl.handle.net/10397/117695 | - |
| dc.language.iso | en | en_US |
| dc.publisher | Pergamon Press | en_US |
| dc.subject | Building load prediction | en_US |
| dc.subject | Dataset preprocessing | en_US |
| dc.subject | Feature extraction | en_US |
| dc.subject | Machine learning | en_US |
| dc.title | Dataset preprocessing and optimization for machine learning-based building load prediction : a review | en_US |
| dc.type | Journal/Magazine Article | en_US |
| dc.identifier.volume | 344 | en_US |
| dc.identifier.doi | 10.1016/j.energy.2026.139992 | en_US |
| dcterms.abstract | Machine learning is widely recognized as a crucial solution in the field of building load forecasting. Datasets play a pivotal role in machine learning, as their quality and characteristics directly determine the performance of prediction models. However, current research predominantly focuses on algorithms or summarizing the overall prediction process, often neglecting the processing and optimization of datasets for algorithmic models. This oversight limits the accuracy and reliability of prediction models in practical applications. To address this issue, this review centers on building load forecasting based on machine learning and organizes the existing research on enhancing model performance from the perspective of dataset processing and optimization. Firstly, it reviews the different building types and their corresponding data characteristics within existing building load forecasting. It constructs an enhanced framework by focusing on dimensions like feature optimization and sample data optimization. Secondly, this review comprehensively analyzes the complex impact of data characteristics from various building types on prediction accuracy. It delves deeply into data acquisition sources and data quality defects, presenting corresponding solutions. Furthermore, it evaluates multiple methods for feature selection and extraction, along with the applicable scenarios for different methods. Finally, this review discusses systematic strategies for sample data optimization. This study rigorously examines machine learning aspects in preprocessing datasets for building load prediction, offering systematic support and multidimensional insights for dataset-based algorithmic model optimization. | en_US |
| dcterms.accessRights | embargoed access | en_US |
| dcterms.bibliographicCitation | Energy, 1 Feb. 2026, v. 344, 139992 | en_US |
| dcterms.isPartOf | Energy | en_US |
| dcterms.issued | 2026-02-01 | - |
| dc.identifier.scopus | 2-s2.0-105027429205 | - |
| dc.identifier.eissn | 1873-6785 | en_US |
| dc.identifier.artn | 139992 | en_US |
| dc.description.validate | 202602 bchy | en_US |
| dc.description.oa | Not applicable | en_US |
| dc.identifier.SubFormID | G001039/2026-02 | - |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | The research work presented in this paper is supported by a grant of Shenzhen Science and Technology Program (No. JCYJ20240813151049063 ), and a grant of the National Natural Science Foundation of China (No. 52278133 ). | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.date.embargo | 2028-02-01 | en_US |
| dc.description.oaCategory | Green (AAM) | en_US |
| Appears in Collections: | Journal/Magazine Article | |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



