Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/97798
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorSchool of Optometryen_US
dc.creatorWin, ZMen_US
dc.creatorCheong, AMYen_US
dc.creatorHopkins, WSen_US
dc.date.accessioned2023-03-22T07:50:54Z-
dc.date.available2023-03-22T07:50:54Z-
dc.identifier.issn1549-9596en_US
dc.identifier.urihttp://hdl.handle.net/10397/97798-
dc.language.isoenen_US
dc.publisherAmerican Chemical Societyen_US
dc.rights© 2023 American Chemical Societyen_US
dc.rightsThis document is the Accepted Manuscript version of a Published Work that appeared in final form in Journal of chemical information and modeling, copyright © American Chemical Society after peer review and technical editing by the publisher. To access the final edited and published work see https://doi.org/10.1021/acs.jcim.2c01373.en_US
dc.titleUsing machine learning to predict partition coefficient (Log P) and distribution coefficient (Log D) with molecular descriptors and liquid chromatography retention timeen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage1906en_US
dc.identifier.epage1913en_US
dc.identifier.volume63en_US
dc.identifier.issue7en_US
dc.identifier.doi10.1021/acs.jcim.2c01373en_US
dcterms.abstractDuring preclinical evaluations of drug candidates, several physicochemical (p-chem) properties are measured and employed as metrics to estimate drug efficacy in vivo. Two such p-chem properties are the octanol–water partition coefficient, Log P, and distribution coefficient, Log D, which are useful in estimating the distribution of drugs within the body. Log P and Log D are traditionally measured using the shake-flask method and high-performance liquid chromatography. However, it is challenging to measure these properties for species that are very hydrophobic (or hydrophilic) owing to the very low equilibrium concentrations partitioned into octanol (or aqueous) phases. Moreover, the shake-flask method is relatively time-consuming and can require multistep dilutions as the range of analyte concentrations can differ by several orders of magnitude. Here, we circumvent these limitations by using machine learning (ML) to correlate Log P and Log D with liquid chromatography (LC) retention time (RT). Predictive models based on four ML algorithms, which used molecular descriptors and LC RTs as features, were extensively tested and compared. The inclusion of RT as an additional descriptor improves model performance (MAE = 0.366 and R2 = 0.89), and Shapley additive explanations analysis indicates that RT has the highest impact on model accuracy.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationJournal of chemical information and modeling, 10 Apr. 2023, v. 63, no. 7, p. 1906-1913en_US
dcterms.isPartOfJournal of chemical information and modelingen_US
dcterms.issued2023-04-10-
dc.identifier.pmid36926888-
dc.identifier.eissn1549-960Xen_US
dc.description.validate202303 bcwwen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumbera1962-
dc.identifier.SubFormID46210-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextthe Government of the Hong Kong Special Administrative Region & InnoHKen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Win_Using_Machine_Learning.pdfPre-Published version1.92 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

93
Citations as of May 11, 2025

Downloads

263
Citations as of May 11, 2025

SCOPUSTM   
Citations

17
Citations as of Jun 5, 2025

WEB OF SCIENCETM
Citations

17
Citations as of Jun 5, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.