Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/112916
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorSchool of Fashion and Textiles-
dc.creatorHuang, Y-
dc.creatorQian, J-
dc.creatorZhu, S-
dc.creatorLi, J-
dc.creatorYang, J-
dc.date.accessioned2025-05-15T06:58:59Z-
dc.date.available2025-05-15T06:58:59Z-
dc.identifier.urihttp://hdl.handle.net/10397/112916-
dc.language.isoenen_US
dc.publisherElsevieren_US
dc.rights© 2024 The Author(s). CAAI Transactions on Intelligence Technology published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology and Chongqing University of Technology.en_US
dc.rightsThis is an open access article under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.en_US
dc.rightsThe following publication Huang, Y., et al.: Robust style injection for person image synthesis. CAAI Trans. Intell. Technol. 10(2), 402–414 (2025) is available at https://dx.doi.org/10.1049/cit2.12361.en_US
dc.subjectComputer visionen_US
dc.subjectImage reconstructionen_US
dc.subjectVirtual try‐onen_US
dc.titleRobust style injection for person image synthesisen_US
dc.typeJournal/Magazine Articleen_US
dc.identifier.spage402-
dc.identifier.epage414-
dc.identifier.volume10-
dc.identifier.issue2-
dc.identifier.doi10.1049/cit2.12361-
dcterms.abstractPerson Image Synthesis has been widely used in fashion with extensive application scenarios. The point of this task is how to synthesise person image from a single source image under arbitrary poses. Prior methods generate the person image with target pose well; however, they fail to preserve the fine style details of the source image. To address this problem, a robust style injection (RSI) model is proposed, which is a coarse-to-fine framework to synthesise target the person image. RSI develops a simple and efficient cross-attention based module to fuse the features of both source semantic styles and target pose for achieving the coarse aligned features. The adaptive instance normalisation is employed to enhance the aligned features in conjunction with source semantic styles. Subsequently, source semantic styles are further injected into the positional normalisation scheme to avoid the fine style details erosion caused by massive convolution. In training losses, optimal transport theory in the form of energy distance is introduced to constrain data distribution to refine the texture style details. Additionally, the authors’ model is capable of editing the shape and texture of garments to the target style separately. The experiments demonstrate that the authors’ RSI achieves better performance over the state-of-art methods.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitationCAAI transactions on intelligence technology, Apr. 2024, v. 10, no. 2, p. 402-414-
dcterms.isPartOfCAAI transactions on intelligence technology-
dcterms.issued2025-04-
dc.identifier.scopus2-s2.0-85215416207-
dc.identifier.eissn2468-2322-
dc.description.validate202505 bcrc-
dc.description.oaVersion of Recorden_US
dc.identifier.FolderNumberOA_Scopus/WOSen_US
dc.description.fundingSourceOthersen_US
dc.description.fundingTextNational Science Fund of Chinaen_US
dc.description.pubStatusPublisheden_US
dc.description.oaCategoryCCen_US
Appears in Collections:Journal/Magazine Article
Files in This Item:
File Description SizeFormat 
Huang_Robust_Style_Injection.pdf5.09 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Version of Record
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.