Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/89010
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Building and Real Estate-
dc.contributorResearch Institute for Sustainable Urban Development-
dc.creatorJiang, SL-
dc.creatorLi, G-
dc.creatorYao, W-
dc.creatorHong, ZH-
dc.creatorKuc, TY-
dc.date.accessioned2021-01-15T07:14:47Z-
dc.date.available2021-01-15T07:14:47Z-
dc.identifier.issn1682-1750-
dc.identifier.urihttp://hdl.handle.net/10397/89010-
dc.description2020 24th ISPRS Congress - Technical Commission II, 31 August - 2 September 2020en_US
dc.language.isoenen_US
dc.publisherCopernicus GmbHen_US
dc.rights© Author(s) 2020. This work is distributed under the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Jiang, S. L., Li, G., Yao, W., Hong, Z. H., and Kuc, T. Y.: DUAL PYRAMIDS ENCODER-DECODER NETWORK FOR SEMANTIC SEGMENTATION IN GROUND AND AERIAL VIEW IMAGES, Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B2-2020, 605–610, is available at https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-605-2020, 2020en_US
dc.subjectAerial and ground view imageen_US
dc.subjectConvolution neural networken_US
dc.subjectEncoder-Decoder networken_US
dc.subjectSemantic segmentationen_US
dc.titleDual pyramids encoder-decoder network for semantic segmentation in ground and aerial view imagesen_US
dc.typeConference Paperen_US
dc.identifier.spage605-
dc.identifier.epage610-
dc.identifier.volume43-
dc.identifier.issueB2-
dc.identifier.doi10.5194/isprs-archives-XLIII-B2-2020-605-2020-
dcterms.abstractSemantic segmentation is a fundamental research task in computer vision, which intends to assign a certain category to every pixel. Currently, most existing methods only utilize the deepest feature map for decoding, while high-level features get inevitably lost during the procedure of down-sampling. In the decoder section, transposed convolution or bilinear interpolation was widely used to restore the size of the encoded feature map; however, few optimizations are applied during up-sampling process which is detrimental to the performance for grouping and classification. In this work, we proposed a dual pyramids encoder-decoder deep neural network (DPEDNet) to tackle the above issues. The first pyramid integrated and encoded multi-resolution features through sequentially stacked merging, and the second pyramid decoded the features through dense atrous convolution with chained up-sampling. Without post-processing and multi-scale testing, the proposed network has achieved state-of-the-art performances on two challenging benchmark image datasets for both ground and aerial view scenes.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationInternational archives of the photogrammetry, remote sensing and spatial information sciences, 2020, v. 43, no. B2, p. 605-610-
dcterms.isPartOfInternational archives of the photogrammetry, remote sensing and spatial information sciences-
dcterms.issued2020-
dc.identifier.scopus2-s2.0-85091116429-
dc.relation.conferenceISPRS Congress on Technical Commission-
dc.identifier.eissn2194-9034-
dc.description.validate202101 bcrc-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.pubStatusPublisheden_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Jiang_Dual_pyramids_encoder-decoder.pdf1.11 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

103
Last Week
3
Last month
Citations as of May 5, 2024

Downloads

41
Citations as of May 5, 2024

SCOPUSTM   
Citations

1
Citations as of Apr 26, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.