Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/101453
| DC Field | Value | Language |
|---|---|---|
| dc.contributor | Department of Computing | - |
| dc.contributor | Mainland Development Office | - |
| dc.creator | Lu, X | en_US |
| dc.creator | Guo, S | en_US |
| dc.creator | Liu, Z | en_US |
| dc.creator | Guo, J | en_US |
| dc.date.accessioned | 2023-09-18T02:26:37Z | - |
| dc.date.available | 2023-09-18T02:26:37Z | - |
| dc.identifier.isbn | 979-8-3503-0129-8 (Electronic) | en_US |
| dc.identifier.isbn | 979-8-3503-0130-4 (Print on Demand(PoD)) | en_US |
| dc.identifier.uri | http://hdl.handle.net/10397/101453 | - |
| dc.language.iso | en | en_US |
| dc.publisher | IEEE | en_US |
| dc.rights | ©2023 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | en_US |
| dc.rights | The following publication X. Lu, S. Guo, Z. Liu and J. Guo, "Decomposed Soft Prompt Guided Fusion Enhancing for Compositional Zero-Shot Learning," 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 2023, pp. 23560-23569, is available at https://doi.org/10.1109/CVPR52729.2023.02256. | en_US |
| dc.subject | Low-level vision | en_US |
| dc.title | Decomposed soft prompt guided fusion enhancing for compositional zero-shot learning | en_US |
| dc.type | Conference Paper | en_US |
| dc.identifier.spage | 23560 | en_US |
| dc.identifier.epage | 23569 | en_US |
| dc.identifier.doi | 10.1109/CVPR52729.2023.02256 | en_US |
| dcterms.abstract | Compositional Zero-Shot Learning (CZSL) aims to recognize novel concepts formed by known states and objects during training. Existing methods either learn the combined state-object representation, challenging the generalization of unseen compositions, or design two classifiers to identify state and object separately from image features, ignoring the intrinsic relationship between them. To jointly eliminate the above issues and construct a more robust CZSL system, we propose a novel framework termed Decomposed Fusion with Soft Prompt (DFSP) 1 1 Code is available at: https://github.corn/Forest-art/DFSP.git, by involving vision-language models (VLMs)for unseen composition recognition. Specifically, DFSP constructs a vector combination of learnable soft prompts with state and object to establish the joint representation of them. In addition, a cross-modal decomposed fusion module is designed between the language and image branches, which decomposes state and object among language features instead of image features. Notably, being fused with the decomposed features, the image features can be more expressive for learning the relationship with states and objects, respectively, to improve the response of unseen compositions in the pair space, hence narrowing the domain gap between seen and unseen sets. Experimental results on three challenging benchmarks demonstrate that our approach significantly outperforms other state-of-the-art methods by large margins. | - |
| dcterms.accessRights | open access | en_US |
| dcterms.bibliographicCitation | 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Vancouver, BC, Canada, 17-24 June 2023, p. 23560-23569 | en_US |
| dcterms.issued | 2023 | - |
| dc.identifier.ros | 2022003122 | - |
| dc.relation.ispartofbook | 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) | en_US |
| dc.relation.conference | IEEE/CVF Conference on Computer Vision and Pattern Recognition [CVPR] | - |
| dc.description.validate | 202309 bcww | - |
| dc.description.oa | Accepted Manuscript | en_US |
| dc.identifier.FolderNumber | CDCF_2022-2023 | - |
| dc.description.fundingSource | RGC | en_US |
| dc.description.fundingSource | Others | en_US |
| dc.description.fundingText | Key-Area Research and Development Program of Guangdong Province (No. 2021B0101400003); Areas of Excellence Scheme (AoE/E-601/22-R); General Research Fund (No. 152203/20E, 152244/21E, 152169/22E, 152211/23E); the National Natural Science Foundation of China (No. 62102327); PolyU Internal Fund (No. P0043932); Shenzhen Science and Technology Innovation Commission (JCYJ20200109142008673) | en_US |
| dc.description.pubStatus | Published | en_US |
| dc.description.oaCategory | Green (AAM) | en_US |
| Appears in Collections: | Conference Paper | |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Lu_Decomposed_Soft_Prompt.pdf | Pre-Published version | 2.11 MB | Adobe PDF | View/Open |
Page views
106
Citations as of Apr 14, 2025
Downloads
102
Citations as of Apr 14, 2025
SCOPUSTM
Citations
3
Citations as of Jun 21, 2024
WEB OF SCIENCETM
Citations
3
Citations as of Oct 10, 2024
Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



