Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/104394
PIRA download icon_1.1View/Download Full Text
Title: A novel unambiguous strategy of molecular feature extraction in machine learning assisted predictive models for environmental properties
Authors: Wang, Z
Su, Y
Jin, S
Shen, W
Ren, J 
Zhang, X
Clark, JH
Issue Date: 21-Jun-2020
Source: Green chemistry, 21 June 2020, v. 22, no. 12, p. 3867-3876
Abstract: Environmental properties of compounds provide significant information in treating organic pollutants, which drives the chemical process and environmental science toward eco-friendly technology. Traditional group contribution methods play an important role in property estimations, whereas various disadvantages emerge in their applications, such as scattered predicted values for certain groups of compounds. In order to address such issues, an extraction strategy for molecular features is proposed in this research, which is characterized by interpretability and discriminating power with regard to isomers. Based on the Henry's law constant data of organic compounds in water, we developed a hybrid predictive model that integrates the proposed strategy in conjunction with a neural network framework. The structure of the predictive model is optimized using cross-validation and grid search to improve its robustness. Moreover, the predictive model is improved by introducing the plane of best fit descriptor as input and adopting k-means clustering in sampling. In contrast with reported models in the literature, the developed predictive model demonstrates improved generality, higher accuracy, and fewer molecular features used in its development.
Publisher: Royal Society of Chemistry
Journal: Green chemistry 
ISSN: 1463-9262
EISSN: 1463-9270
DOI: 10.1039/d0gc01122c
Rights: This journal is © The Royal Society of Chemistry 2020
The following publication Wang, Z., Su, Y., Jin, S., Shen, W., Ren, J., Zhang, X., & Clark, J. H. (2020). A novel unambiguous strategy of molecular feature extraction in machine learning assisted predictive models for environmental properties. Green Chemistry, 22(12), 3867–3876 is available at https://doi.org/10.1039/d0gc01122c.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Ren_Novel_Unambiguous_Strategy.pdfPre-Published version806.42 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

90
Last Week
1
Last month
Citations as of Dec 21, 2025

Downloads

54
Citations as of Dec 21, 2025

SCOPUSTM   
Citations

39
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

35
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.