Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/97568
Title: | Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks | Authors: | Liu, P Chi, HL Li, X Guo, J |
Issue Date: | Dec-2021 | Source: | Automation in construction, Dec. 2021, v. 132, 103901 | Abstract: | Fatigue of operators due to intensive workloads and long working time is a significant constraint that leads to inefficient crane operations and increased risk of safety issues. It can be potentially prevented through early warnings of fatigue for further appropriate work shift arrangements. Many deep neural networks have recently been developed for the fatigue detection of vehicle drivers through training and processing the facial image or video data from the public driver's datasets. However, these datasets are difficult to directly use for the fatigue detections under crane operation scenarios due to the variations of facial features and head movement patterns between crane operators and vehicle drivers. Furthermore, there is no representative and public dataset with the facial information of crane operators under construction scenarios. Therefore, this study aims to explore and analyse the features of multi-sources datasets and the corresponding data acquisition methods which are suitable for crane operators' fatigue detection, further providing collection guidelines of crane operators dataset. Variations on public datasets such as real or pretend facial expression, the segment level of human-verified labelling, camera positions, acquisition scenarios, and illumination conditions are analysed. A hybrid learning architecture is proposed by combining convolutional neural networks (CNN) and long short-term memory (LSTM) for fatigue detection. In order to establish a unified evaluation criterion, the effort of the study includes relabelling three public vehicle drivers datasets, NTHU-DDD, UTA-RLDD, and YawnDD, with human-verified labels at the frame and minute segment levels, and training the corresponding hybrid fatigue detection models accordingly. The average detection accuracies and losses are identified for the trained models of UTA-RLDD, NTHU-DDD, and YawnDD individually. The trained models are used to evaluate the fatigue status of facial videos from licensed crane operators under simulated crane operation scenarios. The results suggest the necessary considerations of different influential factors for establishing a large and public fatigue dataset for crane operators. | Keywords: | Construction safety Convolutional neural network (CNN) Fatigue detection Long short-term memory network (LSTM) Multi-sources datasets Tower crane operator |
Publisher: | Elsevier | Journal: | Automation in construction | ISSN: | 0926-5805 | EISSN: | 1872-7891 | DOI: | 10.1016/j.autcon.2021.103901 | Rights: | © 2021 Elsevier B.V. All rights reserved. © 2021. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/. The following publication Liu, P., Chi, H.-L., Li, X., & Guo, J. (2021). Effects of dataset characteristics on the performance of fatigue detection for crane operators using hybrid deep neural networks. Automation in Construction, 132, 103901 is available at https://dx.doi.org/10.1016/j.autcon.2021.103901. |
Appears in Collections: | Journal/Magazine Article |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Liu_Effects_Dataset_Characteristics.pdf | Pre-Published version | 3.38 MB | Adobe PDF | View/Open |
Page views
83
Citations as of Apr 13, 2025
Downloads
76
Citations as of Apr 13, 2025
SCOPUSTM
Citations
46
Citations as of Apr 24, 2025
WEB OF SCIENCETM
Citations
35
Citations as of Apr 24, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.