Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/80855
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computingen_US
dc.contributorSchool of Designen_US
dc.creatorJiang, Yen_US
dc.creatorChi, Zen_US
dc.date.accessioned2019-06-27T06:36:08Z-
dc.date.available2019-06-27T06:36:08Z-
dc.identifier.issn2076-3417en_US
dc.identifier.urihttp://hdl.handle.net/10397/80855-
dc.language.isoenen_US
dc.publisherMolecular Diversity Preservation International (MDPI)en_US
dc.rights© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Jiang Y, Chi Z. A CNN Model for Human Parsing Based on Capacity Optimization. Applied Sciences. 2019; 9(7):1330 is available at https://doi.org/10.3390/app9071330en_US
dc.subjectCapacity optimizationen_US
dc.subjectComputational efficiencyen_US
dc.subjectDepth-estimationen_US
dc.subjectHuman parsingen_US
dc.titleA CNN model for human parsing based on capacity optimizationen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.volume9en_US
dc.identifier.issue7en_US
dc.identifier.doi10.3390/app9071330en_US
dcterms.abstractAlthough a state-of-the-art performance has been achieved in pixel-specific tasks, such as saliency prediction and depth estimation, convolutional neural networks (CNNs) still perform unsatisfactorily in human parsing where semantic information of detailed regions needs to be perceived under the influences of variations in viewpoints, poses, and occlusions. In this paper, we propose to improve the robustness of human parsing modules by introducing a depth-estimation module. A novel scheme is proposed for the integration of a depth-estimation module and a human-parsing module. The robustness of the overall model is improved with the automatically obtained depth labels. As another major concern, the computational efficiency is also discussed. Our proposed human parsing module with 24 layers can achieve a similar performance as the baseline CNN model with over 100 layers. The number of parameters in the overall model is less than that in the baseline model. Furthermore, we propose to reduce the computational burden by replacing a conventional CNN layer with a stack of simplified sub-layers to further reduce the overall number of trainable parameters. Experimental results show that the integration of two modules contributes to the improvement of human parsing without additional human labeling. The proposed model outperforms the benchmark solutions and the capacity of our model is better matched to the complexity of the task.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationApplied sciences, 2019, v. 9, no. 7, 1330en_US
dcterms.isPartOfApplied sciencesen_US
dcterms.issued2019-
dc.identifier.scopus2-s2.0-85064090276-
dc.identifier.artn1330en_US
dc.description.validate201906 bcmaen_US
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_IR/PIRAen_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Jiang_CNN_model_human.pdf2.22 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

76
Last Week
2
Last month
Citations as of Apr 21, 2024

Downloads

68
Citations as of Apr 21, 2024

SCOPUSTM   
Citations

5
Citations as of Apr 26, 2024

WEB OF SCIENCETM
Citations

2
Citations as of Apr 25, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.