Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/61859
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electronic and Information Engineering-
dc.creatorWan, S-
dc.creatorMak, MW-
dc.creatorKung, SY-
dc.date.accessioned2016-12-19T08:57:31Z-
dc.date.available2016-12-19T08:57:31Z-
dc.identifier.urihttp://hdl.handle.net/10397/61859-
dc.language.isoenen_US
dc.publisherBioMed Centralen_US
dc.rights© 2016 Wan et al. Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in anymedium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commonslicense, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.en_US
dc.rightsThe following publication Wan, S., Mak, M. W., & Kung, S. Y. (2016). Sparse regressions for predicting and interpreting subcellular localization of multi-label proteins. BMC Bioinformatics, 17, 97, 1-17 is available at https://dx.doi.org/10.1186/s12859-016-0940-xen_US
dc.titleSparse regressions for predicting and interpreting subcellular localization of multi-label proteinsen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.epage17-
dc.identifier.volume17-
dc.identifier.doi10.1186/s12859-016-0940-x-
dcterms.abstractBackground: Predicting protein subcellular localization is indispensable for inferring protein functions. Recent studies have been focusing on predicting not only single-location proteins, but also multi-location proteins. Almost all of the high performing predictors proposed recently use gene ontology (GO) terms to construct feature vectors for classification. Despite their high performance, their prediction decisions are difficult to interpret because of the large number of GO terms involved.-
dcterms.abstractResults: This paper proposes using sparse regressions to exploit GO information for both predicting and interpreting subcellular localization of single- and multi-location proteins. Specifically, we compared two multi-label sparse regression algorithms, namely multi-label LASSO (mLASSO) and multi-label elastic net (mEN), for large-scale predictions of protein subcellular localization. Both algorithms can yield sparse and interpretable solutions. By using the one-vs-rest strategy, mLASSO and mEN identified 87 and 429 out of more than 8,000 GO terms, respectively, which play essential roles in determining subcellular localization. More interestingly, many of the GO terms selected by mEN are from the biological process and molecular function categories, suggesting that the GO terms of these categories also play vital roles in the prediction. With these essential GO terms, not only where a protein locates can be decided, but also why it resides there can be revealed.-
dcterms.abstractConclusions: Experimental results show that the output of both mEN and mLASSO are interpretable and they perform significantly better than existing state-of-the-art predictors. Moreover, mEN selects more features and performs better than mLASSO on a stringent human benchmark dataset. For readers' convenience, an online server called SpaPredictor for both mLASSO and mEN is available at http://bioinfo.eie.polyu.edu.hk/SpaPredictorServer/.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationBMC bioinformatics, 2016, v. 17, 97, p. 1-17-
dcterms.isPartOfBMC bioinformatics-
dcterms.issued2016-
dc.identifier.isiWOS:000370775000002-
dc.identifier.scopus2-s2.0-84977480600-
dc.identifier.pmid26911432-
dc.identifier.eissn1471-2105-
dc.identifier.artn97-
dc.identifier.rosgroupid2015002418-
dc.description.ros2015-2016 > Academic research: refereed > Publication in refereed journal-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_IR/PIRAen_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Wan_Sparse_Regressions_Predicting.pdf2.67 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

150
Last Week
2
Last month
Citations as of Apr 14, 2024

Downloads

109
Citations as of Apr 14, 2024

SCOPUSTM   
Citations

24
Last Week
0
Last month
Citations as of Apr 4, 2024

WEB OF SCIENCETM
Citations

23
Last Week
0
Last month
Citations as of Apr 18, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.