Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/16759
PIRA download icon_1.1View/Download Full Text
Title: HybridGO-Loc : Mining hybrid features on gene ontology for predicting subcellular localization of multi-location proteins
Authors: Wan, S 
Mak, MW 
Kung, SY
Issue Date: 2014
Source: PLoS one, 2014, v. 9, no. 3, e89545
Abstract: Protein subcellular localization prediction, as an essential step to elucidate the functions in vivo of proteins and identify drugs targets, has been extensively studied in previous decades. Instead of only determining subcellular localization of single-label proteins, recent studies have focused on predicting both single- and multi-location proteins. Computational methods based on Gene Ontology (GO) have been demonstrated to be superior to methods based on other features. However, existing GO-based methods focus on the occurrences of GO terms and disregard their relationships. This paper proposes a multi-label subcellular-localization predictor, namely HybridGO-Loc, that leverages not only the GO term occurrences but also the inter-term relationships. This is achieved by hybridizing the GO frequencies of occurrences and the semantic similarity between GO terms. Given a protein, a set of GO terms are retrieved by searching against the gene ontology database, using the accession numbers of homologous proteins obtained via BLAST search as the keys. The frequency of GO occurrences and semantic similarity (SS) between GO terms are used to formulate frequency vectors and semantic similarity vectors, respectively, which are subsequently hybridized to construct fusion vectors. An adaptive-decision based multi-label support vector machine (SVM) classifier is proposed to classify the fusion vectors. Experimental results based on recent benchmark datasets and a new dataset containing novel proteins show that the proposed hybrid-feature predictor significantly outperforms predictors based on individual GO features as well as other state-of-the-art predictors. For readers' convenience, the HybridGO-Loc server, which is for predicting virus or plant proteins, is available online at http://bioinfo.eie.polyu.edu.hk/ HybridGoServer/.
Publisher: Public Library of Science
Journal: PLoS one 
EISSN: 1932-6203
DOI: 10.1371/journal.pone.0089545
Rights: © 2014 Wan et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
The following publication: Wan S, Mak M-W, Kung S-Y (2014) HybridGO-Loc: Mining Hybrid Features on Gene Ontology for Predicting Subcellular Localization of Multi-Location Proteins. PLoS ONE 9(3): e89545 is available at https://doi.org/10.1371/journal.pone.0089545
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Wan_HybridGO-Loc_Mining.PDF976.24 kBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

111
Last Week
1
Last month
Citations as of Apr 21, 2024

Downloads

93
Citations as of Apr 21, 2024

SCOPUSTM   
Citations

55
Last Week
0
Last month
1
Citations as of Apr 19, 2024

WEB OF SCIENCETM
Citations

66
Last Week
0
Last month
0
Citations as of Apr 18, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.