Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118681
PIRA download icon_1.1View/Download Full Text
Title: Spatial–temporal diffusion model for underwater scene reconstruction with application to AUV navigation
Authors: Zhang, Z 
Fang, L
Yan, Z
Chen, T
Wang, B 
Wen, CY 
Issue Date: Dec-2025
Source: IEEE/ASME transactions on mechatronics, Dec. 2025, v. 30, no. 6, p. 4142-4153
Abstract: autonomous underwater vehicles (AUVs) have been extensively utilized in subsea exploration and surveying. However, accurately perceiving the surrounding environment remains a significant challenge for AUVs due to the complexities of subsea terrains. To address this issue, we propose a novel generative scene reconstruction method to enhance AUVs’ perception capabilities. Our method is primarily designed for reconstructing dense subsea terrain from 3-D multibeam echosounder data. We leverage local diffusion and denoising strategies to reconstruct complete subsea terrain at the scene scale directly, without requiring normalization from point clouds. Considering the motion dynamics of AUVs and the overlap between consecutive sonar frames, we introduce a spatial–temporal attention mechanism to aggregate features from consecutive point clouds and guide the reconstruction process as a condition. Then, the reconstructed point cloud is utilized for probabilistic terrain modeling through Bayesian updating, enabling path planning. Experiments conducted on simulation and real-world datasets demonstrate that our method can generate more accurate and complete terrain maps. Furthermore, path planning based on our reconstruction method achieves the shortest and smoothest motion path, further validating that our reconstruction method can provide more complete perception information for AUV navigation.
Keywords: Autonomous underwater vehicle (AUV)
Diffusion model
Scene reconstruction
Subsea terrain perception
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE/ASME transactions on mechatronics 
ISSN: 1083-4435
EISSN: 1941-014X
DOI: 10.1109/TMECH.2025.3600436
Research Data: https://github.com/sam-zyzhang/SonarPC-Diff
Rights: © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication Z. Zhang, L. Fang, Z. Yan, T. Chen, B. Wang and C. -y. Wen, 'Spatial–Temporal Diffusion Model for Underwater Scene Reconstruction With Application to AUV Navigation,' in IEEE/ASME Transactions on Mechatronics, vol. 30, no. 6, pp. 4142-4153, Dec. 2025 is available at https://doi.org/10.1109/TMECH.2025.3600436.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Zhang_Spatial_Temporal_Diffusion.pdfPre-Published version5.59 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.