Temporal consistency for RGB-Thermal data-based semantic scene understanding

Li, H; Chu, HK; Sun, Y

doi:10.1109/LRA.2024.3458594

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113767

DC Field	Value	Language
dc.contributor	Department of Mechanical Engineering	-
dc.creator	Li, H	-
dc.creator	Chu, HK	-
dc.creator	Sun, Y	-
dc.date.accessioned	2025-06-23T00:57:53Z	-
dc.date.available	2025-06-23T00:57:53Z	-
dc.identifier.uri	http://hdl.handle.net/10397/113767	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.rights	© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.	en_US
dc.rights	The following publication H. Li, H. K. Chu and Y. Sun, "Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding," in IEEE Robotics and Automation Letters, vol. 9, no. 11, pp. 9757-9764, Nov. 2024 is available at https://doi.org/10.1109/LRA.2024.3458594.	en_US
dc.subject	Autonomous vehicles	en_US
dc.subject	Multi-modal fusion	en_US
dc.subject	RGB-Thermal	en_US
dc.subject	Semantic segmentation	en_US
dc.subject	Temporal consistency	en_US
dc.title	Temporal consistency for RGB-Thermal data-based semantic scene understanding	en_US
dc.type	Journal/Magazine Article	en_US
dc.identifier.spage	9757	-
dc.identifier.epage	9764	-
dc.identifier.volume	9	-
dc.identifier.issue	11	-
dc.identifier.doi	10.1109/LRA.2024.3458594	-
dcterms.abstract	Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic scene understanding performance using only RGB images are usually degraded. Thermal images can provide complementary information to RGB images, so many recent semantic segmentation networks have been proposed using RGB-Thermal (RGB-T) images. However, most existing networks focus only on improving segmentation accuracy for single image frames, omitting the information consistency between consecutive frames. To provide a solution to this issue, we propose a temporal-consistent framework for RGB-T semantic segmentation, which introduces a virtual view image generation module to synthesize a virtual image for the next moment, and a consistency loss function to ensure the segmentation consistency. We also propose an evaluation metric to measure both the accuracy and consistency for semantic segmentation. Experimental results show that our framework outperforms state-of-the-art methods.	-
dcterms.accessRights	open access	en_US
dcterms.bibliographicCitation	IEEE robotics and automation letters, Nov. 2024, v. 9, no. 11, p. 9757-9764	-
dcterms.isPartOf	IEEE robotics and automation letters	-
dcterms.issued	2024-11	-
dc.identifier.scopus	2-s2.0-85204027464	-
dc.identifier.eissn	2377-3766	-
dc.description.validate	202506 bcch	-
dc.description.oa	Accepted Manuscript	en_US
dc.identifier.FolderNumber	a3739	en_US
dc.identifier.SubFormID	50913	en_US
dc.description.fundingSource	Others	en_US
dc.description.fundingText	Innovation and Technology Fund	en_US
dc.description.pubStatus	Published	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
Li_Temporal_Consistency_RGB-Thermal.pdf	Pre-Published version	5.47 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Final Accepted Manuscript

Access

View full-text via PolyU eLinks

Show simple item record

SCOPUS^TM
Citations

5

Citations as of Nov 21, 2025

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

SCOPUSTM Citations

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

Google Scholar^TM