Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/106937
| Title: | Pyramid dilated deeper ConvLSTM for video salient object detection | Authors: | Song, H Wang, W Zhao, S Shen, J Lam, KM |
Issue Date: | 2018 | Source: | Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics), 2018, v. 11215, p. 744-760 | Abstract: | This paper proposes a fast video salient object detection model, based on a novel recurrent network architecture, named Pyramid Dilated Bidirectional ConvLSTM (PDB-ConvLSTM). A Pyramid Dilated Convolution (PDC) module is first designed for simultaneously extracting spatial features at multiple scales. These spatial features are then concatenated and fed into an extended Deeper Bidirectional ConvLSTM (DB-ConvLSTM) to learn spatiotemporal information. Forward and backward ConvLSTM units are placed in two layers and connected in a cascaded way, encouraging information flow between the bi-directional streams and leading to deeper feature extraction. We further augment DB-ConvLSTM with a PDC-like structure, by adopting several dilated DB-ConvLSTMs to extract multi-scale spatiotemporal information. Extensive experimental results show that our method outperforms previous video saliency models in a large margin, with a real-time speed of 20 fps on a single GPU. With unsupervised video object segmentation as an example application, the proposed model (with a CRF-based post-process) achieves state-of-the-art results on two popular benchmarks, well demonstrating its superior performance and high applicability. | Publisher: | Springer | Journal: | Lecture notes in computer science (including subseries Lecture notes in artificial intelligence and lecture notes in bioinformatics) | ISBN: | 978-3-030-01251-9 978-3-030-01252-6 (eBook) |
ISSN: | 0302-9743 | EISSN: | 1611-3349 | DOI: | 10.1007/978-3-030-01252-6_44 | Description: | 15th European Conference on Computer Vision, ECCV 2018, Munich, Germany, September 8-14, 2018 | Rights: | © Springer Nature Switzerland AG 2018 This version of the proceeding paper has been accepted for publication, after peer review (when applicable) and is subject to Springer Nature’s AM terms of use(https://www.springernature.com/gp/open-research/policies/accepted-manuscript-terms), but is not the Version of Record and does not reflect post-acceptance improvements, or any corrections. The Version of Record is available online at: http://dx.doi.org/10.1007/978-3-030-01252-6_44. |
| Appears in Collections: | Conference Paper |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Lam_Pyramid_Dilated_Deeper.pdf | Pre-Published version | 2.48 MB | Adobe PDF | View/Open |
Page views
147
Last Week
13
13
Last month
Citations as of Apr 12, 2026
Downloads
139
Citations as of Apr 12, 2026
SCOPUSTM
Citations
119
Citations as of May 8, 2026
WEB OF SCIENCETM
Citations
383
Citations as of Apr 23, 2026
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



