Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/104114
PIRA download icon_1.1View/Download Full Text
Title: Insights into ensemble learning-based data-driven model for safety-related property of chemical substances
Authors: Wang, Z
Wen, H
Su, Y
Shen, W
Ren, J 
Ma, Y
Li, J
Issue Date: 2-Feb-2022
Source: Chemical engineering science, 2 Feb. 2022, v. 248, pt. A, 117219
Abstract: Risk assessment relying on characteristics of chemicals in process industries can prevent accidents caused by flammable and combustible liquids and gases. Whereas its application is limited by the lack of safety-related properties for abundant chemicals of interest, which promotes the demand for accurate predictive models to evaluate inherent safety implications of chemicals. In this research, staking-based ensemble learning is comprehensively investigated on safety-related properties to assist the risk assessment. Based on molecular structure-based features, individual and ensemble models are built and compared using heterogeneous machine learning (ML) methods. The systematic ensemble learning workflow is deployed by a case on flash points of chemical substances. Several representative ML methods including multiple linear regression, extreme learning machine, feedforward neural network, and support vector machine are taken into consideration. As it turns out, ensemble models exhibit improved predictive accuracy than standard individual ML models, indicating the effectiveness of ensemble learning on improving model performance. Moreover, extremal evaluations with existing models as well as internal analyses against functional group-based organic compound families and structural feature-based data-driven categories are carried out to identify model reliability. Ensemble learning is demonstrated as an effective approach for high-performance predictive modeling in safety-related risk assessments.
Keywords: Flash point
Machine learning
Molecular feature
Predictive modeling
Publisher: Elsevier Ltd
Journal: Chemical engineering science 
ISSN: 0009-2509
EISSN: 1873-4405
DOI: 10.1016/j.ces.2021.117219
Rights: © 2021 Elsevier Ltd. All rights reserved.
© 2021. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/
The following publication Wang, Z., Wen, H., Su, Y., Shen, W., Ren, J., Ma, Y., & Li, J. (2022). Insights into ensemble learning-based data-driven model for safety-related property of chemical substances. Chemical Engineering Science, 248, 117219 is available at https://doi.org/10.1016/j.ces.2021.117219.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Wang_Insights_Ensemble_Learning_based.pdfPre-Published version1.98 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

104
Last Week
3
Last month
Citations as of Nov 30, 2025

Downloads

71
Citations as of Nov 30, 2025

SCOPUSTM   
Citations

34
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

31
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.