Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/115073
PIRA download icon_1.1View/Download Full Text
Title: GraphATC : advancing multilevel and multi-label anatomical therapeutic chemical classification via atom-level graph learning
Authors: Zhang, W 
Tian, Q
Cao, Y
Fan, W 
Jiang, D
Wang, Y
Li, Q 
Wei, XY 
Issue Date: Mar-2025
Source: Briefings in bioinformatics, Mar. 2025, v. 26, no. 2, bbaf194
Abstract: The accurate categorization of compounds within the anatomical therapeutic chemical (ATC) system is fundamental for drug development and fundamental research. Although this area has garnered significant research focus for over a decade, the majority of prior studies have concentrated solely on the Level 1 labels defined by the World Health Organization (WHO), neglecting the labels of the remaining four levels. This narrow focus fails to address the true nature of the task as a multilevel, multi-label classification challenge. Moreover, existing benchmarks like Chen-2012 and ATC-SMILES have become outdated, lacking the incorporation of new drugs or updated properties of existing ones that have emerged in recent years and have been integrated into the WHO ATC system. To tackle these shortcomings, we present a comprehensive approach in this paper. Firstly, we systematically cleanse and enhance the drug dataset, expanding it to encompass all five levels through a rigorous cross-resource validation process involving KEGG, PubChem, ChEMBL, ChemSpider, and ChemicalBook. This effort culminates in the creation of a novel benchmark termed ATC-GRAPH. Secondly, we extend the classification task to encompass Level 2 and introduce graph-based learning techniques to provide more accurate representations of drug molecular structures. This approach not only facilitates the modeling of Polymers, Macromolecules, and Multi-Component drugs more precisely but also enhances the overall fidelity of the classification process. The efficacy of our proposed framework is validated through extensive experiments, establishing a new state-of-the-art methodology. To facilitate the replication of this study, we have made the benchmark dataset, source code, and web server openly accessible.
Keywords: Anatomical therapeutic chemical
Graph learning
Macromolecules
Molecular structures
Multi-component drugs
Polymers
Publisher: Oxford University Press
Journal: Briefings in bioinformatics 
ISSN: 1467-5463
EISSN: 1477-4054
DOI: 10.1093/bib/bbaf194
Rights: © The Author(s) 2025. Published by Oxford University Press.
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
The following publication Wengyu Zhang, Qi Tian, Yi Cao, Wenqi Fan, Dongmei Jiang, Yaowei Wang, Qing Li, Xiao-Yong Wei, GraphATC: advancing multilevel and multi-label anatomical therapeutic chemical classification via atom-level graph learning, Briefings in Bioinformatics, Volume 26, Issue 2, March 2025, bbaf194 is available at https://doi.org/10.1093/bib/bbaf194.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
bbaf194.pdf2.56 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.