Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/117695
DC FieldValueLanguage
dc.contributorDepartment of Building Environment and Energy Engineeringen_US
dc.creatorGao, DCen_US
dc.creatorZhang, Xen_US
dc.creatorZhang, Yen_US
dc.creatorGao, Yen_US
dc.creatorZou, Wen_US
dc.creatorShan, Ken_US
dc.date.accessioned2026-02-27T02:37:48Z-
dc.date.available2026-02-27T02:37:48Z-
dc.identifier.issn0360-5442en_US
dc.identifier.urihttp://hdl.handle.net/10397/117695-
dc.language.isoenen_US
dc.publisherPergamon Pressen_US
dc.subjectBuilding load predictionen_US
dc.subjectDataset preprocessingen_US
dc.subjectFeature extractionen_US
dc.subjectMachine learningen_US
dc.titleDataset preprocessing and optimization for machine learning-based building load prediction : a reviewen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume344en_US
dc.identifier.doi10.1016/j.energy.2026.139992en_US
dcterms.abstractMachine learning is widely recognized as a crucial solution in the field of building load forecasting. Datasets play a pivotal role in machine learning, as their quality and characteristics directly determine the performance of prediction models. However, current research predominantly focuses on algorithms or summarizing the overall prediction process, often neglecting the processing and optimization of datasets for algorithmic models. This oversight limits the accuracy and reliability of prediction models in practical applications. To address this issue, this review centers on building load forecasting based on machine learning and organizes the existing research on enhancing model performance from the perspective of dataset processing and optimization. Firstly, it reviews the different building types and their corresponding data characteristics within existing building load forecasting. It constructs an enhanced framework by focusing on dimensions like feature optimization and sample data optimization. Secondly, this review comprehensively analyzes the complex impact of data characteristics from various building types on prediction accuracy. It delves deeply into data acquisition sources and data quality defects, presenting corresponding solutions. Furthermore, it evaluates multiple methods for feature selection and extraction, along with the applicable scenarios for different methods. Finally, this review discusses systematic strategies for sample data optimization. This study rigorously examines machine learning aspects in preprocessing datasets for building load prediction, offering systematic support and multidimensional insights for dataset-based algorithmic model optimization.en_US
dcterms.accessRightsembargoed accessen_US
dcterms.bibliographicCitationEnergy, 1 Feb. 2026, v. 344, 139992en_US
dcterms.isPartOfEnergyen_US
dcterms.issued2026-02-01-
dc.identifier.scopus2-s2.0-105027429205-
dc.identifier.eissn1873-6785en_US
dc.identifier.artn139992en_US
dc.description.validate202602 bchyen_US
dc.description.oaNot applicableen_US
dc.identifier.SubFormIDG001039/2026-02-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextThe research work presented in this paper is supported by a grant of Shenzhen Science and Technology Program (No. JCYJ20240813151049063 ), and a grant of the National Natural Science Foundation of China (No. 52278133 ).en_US
dc.description.pubStatusPublisheden_US
dc.date.embargo2028-02-01en_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Open Access Information
Status embargoed access
Embargo End Date 2028-02-01
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.