Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/109174
PIRA download icon_1.1View/Download Full Text
Title: Deep learning framework with Local Sparse Transformer for construction worker detection in 3D with LiDAR
Authors: Zhang, M 
Wang, L 
Han, S 
Wang, S 
Li, H 
Issue Date: 1-Oct-2024
Source: Computer-aided civil and infrastructure engineering, 1 Oct. 2024, v. 39, no. 19, p. 2990-3007
Abstract: Autonomous equipment is playing an increasingly important role in construction tasks. It is essential to equip autonomous equipment with powerful 3D detection capability to avoid accidents and inefficiency. However, there is limited research within the construction field that has extended detection to 3D. To this end, this study develops a light detection and ranging (LiDAR)-based deep-learning model for the 3D detection of workers on construction sites. The proposed model adopts a voxel-based anchor-free 3D object detection paradigm. To enhance the feature extraction capability for tough detection tasks, a novel Transformer-based block is proposed, where the multi-head self-attention is applied in local grid regions. The detection model integrates the Transformer blocks with 3D sparse convolution to extract wide and local features while pruning redundant features in modified downsampling layers. To train and test the proposed model, a LiDAR point cloud dataset was created, which includes workers in construction sites with 3D box annotations. The experiment results indicate that the proposed model outperforms the baseline models with higher mean average precision and smaller regression errors. The method in the study is promising to provide worker detection with rich and accurate 3D information required by construction automation.
Publisher: Wiley-Blackwell Publishing, Inc.
Journal: Computer-aided civil and infrastructure engineering 
ISSN: 1093-9687
EISSN: 1467-8667
DOI: 10.1111/mice.13238
Rights: © 2024 The Author(s). Computer-Aided Civil and Infrastructure Engineering published by Wiley Periodicals LLC on behalf of Editor.
This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits use and distribution in any medium,provided the original work is properly cited, the use is non-commercial and no modifications or adaptations are made.
The following publication Zhang, M., Wang, L., Han, S., Wang, S., & Li, H. (2024). Deep learning framework with Local Sparse Transformer for construction worker detection in 3D with LiDAR. Computer-Aided Civil and Infrastructure Engineering, 39, 2990–3007 is available at https://doi.org/10.1111/mice.13238.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Zhang_Deep_Learning_Framework.pdf4.21 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

13
Citations as of Oct 13, 2024

Downloads

6
Citations as of Oct 13, 2024

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.