Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/112916
| Title: | Robust style injection for person image synthesis | Authors: | Huang, Y Qian, J Zhu, S Li, J Yang, J |
Issue Date: | Apr-2025 | Source: | CAAI transactions on intelligence technology, Apr. 2024, v. 10, no. 2, p. 402-414 | Abstract: | Person Image Synthesis has been widely used in fashion with extensive application scenarios. The point of this task is how to synthesise person image from a single source image under arbitrary poses. Prior methods generate the person image with target pose well; however, they fail to preserve the fine style details of the source image. To address this problem, a robust style injection (RSI) model is proposed, which is a coarse-to-fine framework to synthesise target the person image. RSI develops a simple and efficient cross-attention based module to fuse the features of both source semantic styles and target pose for achieving the coarse aligned features. The adaptive instance normalisation is employed to enhance the aligned features in conjunction with source semantic styles. Subsequently, source semantic styles are further injected into the positional normalisation scheme to avoid the fine style details erosion caused by massive convolution. In training losses, optimal transport theory in the form of energy distance is introduced to constrain data distribution to refine the texture style details. Additionally, the authors’ model is capable of editing the shape and texture of garments to the target style separately. The experiments demonstrate that the authors’ RSI achieves better performance over the state-of-art methods. | Keywords: | Computer vision Image reconstruction Virtual try‐on |
Publisher: | Elsevier | Journal: | CAAI transactions on intelligence technology | EISSN: | 2468-2322 | DOI: | 10.1049/cit2.12361 | Rights: | © 2024 The Author(s). CAAI Transactions on Intelligence Technology published by John Wiley & Sons Ltd on behalf of The Institution of Engineering and Technology and Chongqing University of Technology. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. The following publication Huang, Y., et al.: Robust style injection for person image synthesis. CAAI Trans. Intell. Technol. 10(2), 402–414 (2025) is available at https://dx.doi.org/10.1049/cit2.12361. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Huang_Robust_Style_Injection.pdf | 5.09 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



