Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/118683
| Title: | Collaboratively semantic alignment and metric learning for cross-modal hashing | Authors: | Li, J Wong, WK Jiang, L Jiang, K Fang, X Xie, S Wen, J |
Issue Date: | May-2025 | Source: | IEEE transactions on knowledge and data engineering, May 2025, v. 37, no. 5, p. 2311-2328 | Abstract: | Cross-modal retrieval is a promising technique nowadays to find semantically similar instances in other modalities while a query instance is given from one modality. However, there still exists many challenges for reducing heterogeneous modality gap by embedding label information to discrete hash codes effectively, solving the binary optimization when generating unified hash codes and reducing the discrepancy of data distribution efficiently during common space learning. In order to overcome the above-mentioned challenges, we propose a Collaboratively Semantic alignment and Metric learning for cross-modal Hashing (CSMH) in this paper. Specifically, by a kernelization operation, CSMH first extracts the non-linear data features for each modality, which are projected into a latent subspace to align both marginal and conditional distributions simultaneously. Then, a maximum mean discrepancy-based metric strategy is customized to mitigate the distribution discrepancies among features from different modalities. Finally, semantic information obtained from the label similarity matrix, is further incorporated to embed the latent semantic structure into the discriminant subspace. Experimental results of CSMH and baseline methods on four widely-used datasets show that CSMH outperforms some state-of-the-art hashing baseline methods for cross-modal retrieval on efficiency and precision. | Keywords: | Cross-modal hashing Information retrieval Maximum mean discrepancy Metric learning Semantic alignment |
Publisher: | Institute of Electrical and Electronics Engineers | Journal: | IEEE transactions on knowledge and data engineering | ISSN: | 1041-4347 | EISSN: | 1558-2191 | DOI: | 10.1109/TKDE.2025.3537704 | Rights: | © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication J. Li et al., 'Collaboratively Semantic Alignment and Metric Learning for Cross-Modal Hashing,' in IEEE Transactions on Knowledge and Data Engineering, vol. 37, no. 5, pp. 2311-2328, May 2025 is available at https://doi.org/10.1109/TKDE.2025.3537704. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Li_Collaboratively_Semantic_Alignment.pdf | Pre-Published version | 7.27 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



