Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/92053
PIRA download icon_1.1View/Download Full Text
Title: VNLSTM-PoseNet : a novel deep ConvNet for real-time 6-DOF camera relocalization in urban streets
Authors: Li, M
Qin, JY
Li, DR
Chen, RZ
Liao, X 
Guo, BX
Issue Date: 2021
Source: Geo-spatial information science (地球空间信息科学学报), 2021, v. 24, no. 3, p. 422-437
Abstract: Image-based relocalization is a renewed interest in outdoor environments, because it is an important problem with many applications. PoseNet introduces Convolutional Neural Network (CNN) for the first time to realize the real-time camera pose solution based on a single image. In order to solve the problem of precision and robustness of PoseNet and its improved algorithms in complex environment, this paper proposes and implements a new visual relocation method based on deep convolutional neural networks (VNLSTM-PoseNet). Firstly, this method directly resizes the input image without cropping to increase the receptive field of the training image. Then, the image and the corresponding pose labels are put into the improved Long Short-Term Memory based (LSTM-based) PoseNet network for training and the network is optimized by the Nadam optimizer. Finally, the trained network is used for image localization to obtain the camera pose. Experimental results on outdoor public datasets show our VNLSTM-PoseNet can lead to drastic improvements in relocalization performance compared to existing state-of-the-art CNN-based methods.
Keywords: Camera relocalization
Pose regression
Deep convnet
RGB image
Camera pose
Publisher: Taylor & Francis Asia Pacific (Singapore)
Journal: Geo-spatial information science (地球空间信息科学学报) 
ISSN: 1009-5020
EISSN: 1993-5153
DOI: 10.1080/10095020.2021.1960779
Rights: © 2021 THE AUTHORS. Published by Elsevier BV on behalf of Faculty of Engineering, Ain Shams University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
The following publication Li, M., Qin, J., Li, D., Chen, R., Liao, X., & Guo, B. (2021). VNLSTM-PoseNet: A novel deep ConvNet for real-time 6-DOF camera relocalization in urban streets. Geo-Spatial Information Science, 24(3), 422-437 is available at https://doi.org/10.1080/10095020.2021.1960779
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Li_novel_deep_ConvNet.pdf9.24 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

81
Last Week
0
Last month
Citations as of May 11, 2025

Downloads

72
Citations as of May 11, 2025

SCOPUSTM   
Citations

16
Citations as of Jun 21, 2024

WEB OF SCIENCETM
Citations

14
Citations as of Jun 5, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.