Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106937
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Electrical and Electronic Engineeringen_US
dc.creatorSong, Hen_US
dc.creatorWang, Wen_US
dc.creatorZhao, Sen_US
dc.creatorShen, Jen_US
dc.creatorLam, KMen_US
dc.date.accessioned2024-06-07T00:58:59Z-
dc.date.available2024-06-07T00:58:59Z-
dc.identifier.isbn978-3-030-01251-9en_US
dc.identifier.isbn978-3-030-01252-6 (eBook)en_US
dc.identifier.issn0302-9743en_US
dc.identifier.urihttp://hdl.handle.net/10397/106937-
dc.description15th European Conference on Computer Vision, ECCV 2018, Munich, Germany, September 8-14, 2018en_US
dc.language.isoenen_US
dc.publisherSpringeren_US
dc.rights© Springer Nature Switzerland AG 2018en_US
dc.rightsThis version of the proceeding paper has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use(https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms), but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/978-3-030-01252-6_44.en_US
dc.titlePyramid dilated deeper ConvLSTM for video salient object detectionen_US
dc.typeConference Paperen_US
dc.identifier.spage744en_US
dc.identifier.epage760en_US
dc.identifier.volume11215en_US
dc.identifier.doi10.1007/978-3-030-01252-6_44en_US
dcterms.abstractThis paper proposes a fast video salient object detection model, based on a novel recurrent network architecture, named Pyramid Dilated Bidirectional ConvLSTM (PDB-ConvLSTM). A Pyramid Dilated Convolution (PDC) module is first designed for simultaneously extracting spatial features at multiple scales. These spatial features are then concatenated and fed into an extended Deeper Bidirectional ConvLSTM (DB-ConvLSTM) to learn spatiotemporal information. Forward and backward ConvLSTM units are placed in two layers and connected in a cascaded way, encouraging information flow between the bi-directional streams and leading to deeper feature extraction. We further augment DB-ConvLSTM with a PDC-like structure, by adopting several dilated DB-ConvLSTMs to extract multi-scale spatiotemporal information. Extensive experimental results show that our method outperforms previous video saliency models in a large margin, with a real-time speed of 20 fps on a single GPU. With unsupervised video object segmentation as an example application, the proposed model (with a CRF-based post-process) achieves state-of-the-art results on two popular benchmarks, well demonstrating its superior performance and high applicability.en_US
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationLecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), 2018, v. 11215, p. 744-760en_US
dcterms.isPartOfLecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics)en_US
dcterms.issued2018-
dc.identifier.scopus2-s2.0-85055123293-
dc.relation.conferenceEuropean Conference on Computer Vision [ECCV]en_US
dc.identifier.eissn1611-3349en_US
dc.description.validate202405 bcchen_US
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberEIE-0471-
dc.description.fundingSourceSelf-fundeden_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS20083678-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Lam_Pyramid_Dilated_Deeper.pdfPre-Published version2.48 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

84
Last Week
4
Last month
Citations as of Nov 9, 2025

Downloads

60
Citations as of Nov 9, 2025

SCOPUSTM   
Citations

114
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

366
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.