Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/118683
PIRA download icon_1.1View/Download Full Text
Title: Collaboratively semantic alignment and metric learning for cross-modal hashing
Authors: Li, J
Wong, WK 
Jiang, L
Jiang, K 
Fang, X
Xie, S
Wen, J
Issue Date: May-2025
Source: IEEE transactions on knowledge and data engineering, May 2025, v. 37, no. 5, p. 2311-2328
Abstract: Cross-modal retrieval is a promising technique nowadays to find semantically similar instances in other modalities while a query instance is given from one modality. However, there still exists many challenges for reducing heterogeneous modality gap by embedding label information to discrete hash codes effectively, solving the binary optimization when generating unified hash codes and reducing the discrepancy of data distribution efficiently during common space learning. In order to overcome the above-mentioned challenges, we propose a Collaboratively Semantic alignment and Metric learning for cross-modal Hashing (CSMH) in this paper. Specifically, by a kernelization operation, CSMH first extracts the non-linear data features for each modality, which are projected into a latent subspace to align both marginal and conditional distributions simultaneously. Then, a maximum mean discrepancy-based metric strategy is customized to mitigate the distribution discrepancies among features from different modalities. Finally, semantic information obtained from the label similarity matrix, is further incorporated to embed the latent semantic structure into the discriminant subspace. Experimental results of CSMH and baseline methods on four widely-used datasets show that CSMH outperforms some state-of-the-art hashing baseline methods for cross-modal retrieval on efficiency and precision.
Keywords: Cross-modal hashing
Information retrieval
Maximum mean discrepancy
Metric learning
Semantic alignment
Publisher: Institute of Electrical and Electronics Engineers
Journal: IEEE transactions on knowledge and data engineering 
ISSN: 1041-4347
EISSN: 1558-2191
DOI: 10.1109/TKDE.2025.3537704
Rights: © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication J. Li et al., 'Collaboratively Semantic Alignment and Metric Learning for Cross-Modal Hashing,' in IEEE Transactions on Knowledge and Data Engineering, vol. 37, no. 5, pp. 2311-2328, May 2025 is available at https://doi.org/10.1109/TKDE.2025.3537704.
Appears in Collections:Journal/Magazine Article

Files in This Item:
File Description SizeFormat 
Li_Collaboratively_Semantic_Alignment.pdfPre-Published version7.27 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.