Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/106880
PIRA download icon_1.1View/Download Full Text
Title: Attention-based cross-modality interaction for multispectral pedestrian detection
Authors: Liu, T 
Zhao, R 
Lam, KM 
Issue Date: 2021
Source: Proceedings of SPIE : the International Society for Optical Engineering, 2021, v. 11766, 1176603
Abstract: Multispectral pedestrian detection has attracted extensive attention, as paired RGB-thermal images can provide complementary patterns to deal with illumination changes in realistic scenarios. However, most of the existing deep-learning-based multispectral detectors extract features from RGB and thermal inputs separately, and fuse them by a simple concatenation operation. This fusion strategy is suboptimal, as undifferentiated concatenation for each region and feature channel may hamper the optimal selection of complementary features from different modalities. To address this limitation, in this paper, we propose an attention-based cross-modality interaction (ACI) module, which aims to adaptively highlight and aggregate the discriminative regions and channels of the feature maps from RGB and thermal images. The proposed ACI module is deployed into multiple layers of a two-branch-based deep architecture, to capture the cross-modal interactions from diverse semantic levels, for illumination-invariant pedestrian detection. Experimental results on the public KAIST multispectral pedestrian benchmark show that the proposed method achieves state-of-the-art detection performance.
Publisher: SPIE - International Society for Optical Engineering
Journal: Proceedings of SPIE : the International Society for Optical Engineering 
ISBN: 978-1-5106-4364-2
978-1-5106-4365-9 (electronic)
ISSN: 0277-786X
EISSN: 1996-756X
DOI: 10.1117/12.2590661
Description: International Workshop on Advanced Imaging Technology 2021 (IWAIT 2021), 2021, Online Only
Rights: © (2021) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this publication for a fee or for commercial purposes, and modification of the contents of the publication are prohibited.
The following publication Tianshan Liu, Rui Zhao, and Kin-Man Lam "Attention-based cross-modality interaction for multispectral pedestrian detection", Proc. SPIE 11766, International Workshop on Advanced Imaging Technology (IWAIT) 2021, 1176603 (13 March 2021) is available at https://doi.org/10.1117/12.2590661.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Liu_Attention-Based_Cross-Modality_Interaction.pdfPre-Published version1.01 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

96
Last Week
3
Last month
Citations as of Dec 21, 2025

Downloads

71
Citations as of Dec 21, 2025

SCOPUSTM   
Citations

3
Citations as of Dec 19, 2025

WEB OF SCIENCETM
Citations

3
Citations as of Dec 18, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.