Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/109960
PIRA download icon_1.1View/Download Full Text
Title: Self-supervised multi-task learning framework for safety and health-oriented road environment surveillance based on connected vehicle visual perception
Authors: Jia, S 
Yao, W 
Issue Date: Apr-2024
Source: International journal of applied earth observation and geoinformation, Apr. 2024, v. 128, 103753
Abstract: Cutting-edge connected vehicle (CV) technologies have drawn much attention in recent years. The real-time traffic data captured by a CV can be shared with other CVs and data centers so as to open new possibilities for solving diverse transportation problems. The trajectory data of CVs have been well-studied and widely used. However, image data captured by onboard cameras in a connected environment, as being a kind of fundamental data source, are not sufficiently investigated, especially for safety and health-oriented visual perception. In this paper, a bidirectional process of image synthesis and decomposition (BPISD) approach is proposed, and thus a novel self-supervised multi-task learning framework, to simultaneously estimate depth map, atmospheric visibility, airlight, and PM2.5 mass concentration, in which depth map and visibility are considered highly associated with traffic safety, while airlight and PM2.5 mass concentration are directly correlated with human health. Both the training and testing phases of the proposed system solely require a single image as input. Due to the innovative training pipeline, the depth estimation network can automatically manage various levels of visibility conditions and overcome diverse inherent problems in current image-synthesis-based self-supervised depth estimation, thereby generating high-quality depth maps even in low-visibility situations and further benefiting accurate estimations of visibility, airlight, and PM2.5 mass concentration. Extensive experiments on the original and synthesized data from the KITTI dataset and real-world data collected in Beijing demonstrate that the proposed method can (1) achieve performance comparable in self-supervised depth estimation as compared with other state-of-the-art methods when taking clear images as input; (2) predict vivid depth map for images contaminated by various levels of haze when the network trained with previous framework fails; and (3) accurately estimate visibility, airlight, and PM2.5 mass concentrations. Beneficial applications can be developed based on the presented work to contribute to high-precise and dynamic geoinformation reconstruction, transportation, meteorology, and smart city.
Graphical abstract: [Figure not available: see fulltext.]
Keywords: Airlight estimation
Bidirectional process of image synthesis and decomposition (BPISD)
Depth estimation
PM2.5 mass concentration estimation
Self-supervised learning
Visibility estimation
Publisher: Elsevier BV
Journal: International journal of applied earth observation and geoinformation 
ISSN: 1569-8432
EISSN: 1872-826X
DOI: 10.1016/j.jag.2024.103753
Rights: © 2024 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/bync-nd/4.0/).
The following publication Jia, S., & Yao, W. (2024). Self-supervised multi-task learning framework for safety and health-oriented road environment surveillance based on connected vehicle visual perception. International Journal of Applied Earth Observation and Geoinformation, 128, 103753 is available at https://doi.org/10.1016/j.jag.2024.103753.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
1-s2.0-S1569843224001079-main.pdf4.1 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.