Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/89010
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Building and Real Estate-
dc.contributorResearch Institute for Sustainable Urban Development-
dc.creatorJiang, SL-
dc.creatorLi, G-
dc.creatorYao, W-
dc.creatorHong, ZH-
dc.creatorKuc, TY-
dc.date.accessioned2021-01-15T07:14:47Z-
dc.date.available2021-01-15T07:14:47Z-
dc.identifier.issn1682-1750-
dc.identifier.urihttp://hdl.handle.net/10397/89010-
dc.description2020 24th ISPRS Congress - Technical Commission II, 31 August - 2 September 2020en_US
dc.language.isoenen_US
dc.publisherCopernicus GmbHen_US
dc.rights© Author(s) 2020. This work is distributed under the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/).en_US
dc.rightsThe following publication Jiang, S. L., Li, G., Yao, W., Hong, Z. H., and Kuc, T. Y.: DUAL PYRAMIDS ENCODER-DECODER NETWORK FOR SEMANTIC SEGMENTATION IN GROUND AND AERIAL VIEW IMAGES, Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci., XLIII-B2-2020, 605–610, is available at https://doi.org/10.5194/isprs-archives-XLIII-B2-2020-605-2020, 2020en_US
dc.subjectAerial and ground view imageen_US
dc.subjectConvolution neural networken_US
dc.subjectEncoder-Decoder networken_US
dc.subjectSemantic segmentationen_US
dc.titleDual pyramids encoder-decoder network for semantic segmentation in ground and aerial view imagesen_US
dc.typeConference Paperen_US
dc.identifier.spage605-
dc.identifier.epage610-
dc.identifier.volume43-
dc.identifier.issueB2-
dc.identifier.doi10.5194/isprs-archives-XLIII-B2-2020-605-2020-
dcterms.abstractSemantic segmentation is a fundamental research task in computer vision, which intends to assign a certain category to every pixel. Currently, most existing methods only utilize the deepest feature map for decoding, while high-level features get inevitably lost during the procedure of down-sampling. In the decoder section, transposed convolution or bilinear interpolation was widely used to restore the size of the encoded feature map; however, few optimizations are applied during up-sampling process which is detrimental to the performance for grouping and classification. In this work, we proposed a dual pyramids encoder-decoder deep neural network (DPEDNet) to tackle the above issues. The first pyramid integrated and encoded multi-resolution features through sequentially stacked merging, and the second pyramid decoded the features through dense atrous convolution with chained up-sampling. Without post-processing and multi-scale testing, the proposed network has achieved state-of-the-art performances on two challenging benchmark image datasets for both ground and aerial view scenes.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationInternational archives of the photogrammetry, remote sensing and spatial information sciences, 2020, v. 43, no. B2, p. 605-610-
dcterms.isPartOfInternational archives of the photogrammetry, remote sensing and spatial information sciences-
dcterms.issued2020-
dc.identifier.scopus2-s2.0-85091116429-
dc.relation.conferenceISPRS Congress on Technical Commission-
dc.identifier.eissn2194-9034-
dc.description.validate202101 bcrc-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Jiang_Dual_pyramids_encoder-decoder.pdf1.11 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

211
Last Week
31
Last month
Citations as of Feb 9, 2026

Downloads

74
Citations as of Feb 9, 2026

SCOPUSTM   
Citations

1
Citations as of May 8, 2026

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.