Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/103685
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorSchool of Nursing-
dc.creatorWang, Gen_US
dc.creatorDeng, Zen_US
dc.creatorChoi, KSen_US
dc.date.accessioned2024-01-02T03:09:59Z-
dc.date.available2024-01-02T03:09:59Z-
dc.identifier.issn2168-2194en_US
dc.identifier.urihttp://hdl.handle.net/10397/103685-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights© 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication G. Wang, Z. Deng and K. -S. Choi, "Tackling Missing Data in Community Health Studies Using Additive LS-SVM Classifier," in IEEE Journal of Biomedical and Health Informatics, vol. 22, no. 2, pp. 579-587, March 2018 is available at https://doi.org/10.1109/JBHI.2016.2634587.en_US
dc.subjectCommunity healthen_US
dc.subjectMissing dataen_US
dc.subjectPredictive modelsen_US
dc.subjectQuality of life (QOL)en_US
dc.subjectSupport vector machine (SVM)en_US
dc.titleTackling missing data in community health studies using additive LS-SVM classifieren_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage579en_US
dc.identifier.epage587en_US
dc.identifier.volume22en_US
dc.identifier.issue2en_US
dc.identifier.doi10.1109/JBHI.2016.2634587en_US
dcterms.abstractMissing data is a common issue in community health and epidemiological studies. Direct removal of samples with missing data can lead to reduced sample size and information bias, which deteriorates the significance of the results. While data imputation methods are available to deal with missing data, they are limited in performance and could introduce noises into the dataset. Instead of data imputation, a novel method based on additive least square support vector machine (LS-SVM) is proposed in this paper for predictive modeling when the input features of the model contain missing data. The method also determines simultaneously the influence of the features with missing values on the classification accuracy using the fast leave-one-out cross-validation strategy. The performance of the method is evaluated by applying it to predict the quality of life (QOL) of elderly people using health data collected in the community. The dataset involves demographics, socioeconomic status, health history, and the outcomes of health assessments of 444 community-dwelling elderly people, with 5% to 60% of data missing in some of the input features. The QOL is measured using a standard questionnaire of the World Health Organization. Results show that the proposed method outperforms four conventional methods for handling missing data - case deletion, feature deletion, mean imputation, and K-nearest neighbor imputation, with the average QOL prediction accuracy reaching 0.7418. It is potentially a promising technique for tackling missing data in community health research and other applications.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationIEEE journal of biomedical and health informatics, Mar. 2018, v. 22, no. 2, p. 579-587en_US
dcterms.isPartOfIEEE journal of biomedical and health informaticsen_US
dcterms.issued2018-03-
dc.identifier.scopus2-s2.0-85043263006-
dc.identifier.pmid27925597-
dc.identifier.eissn2168-2208en_US
dc.description.validate202312 bckw-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberSN-0371-
dc.description.fundingSourceOthersen_US
dc.description.fundingTextTai Hung Fai Charitable Fundation; YC Yu Scholarship for Centre for Smart Healthen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS6826069-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Wang_Tackling_Missing_Data.pdfPre-Published version1.32 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

70
Last Week
5
Last month
Citations as of Dec 21, 2025

Downloads

91
Citations as of Dec 21, 2025

SCOPUSTM   
Citations

36
Citations as of Aug 22, 2025

WEB OF SCIENCETM
Citations

23
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.