Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/105456
Title: | High-resolution photorealistic image translation in real-time : a laplacian pyramid translation network | Authors: | Liang, J Zeng, H Zhang, L |
Issue Date: | 2021 | Source: | 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021, p. 9387-9395 | Abstract: | Existing image-to-image translation (I2IT) methods are either constrained to low-resolution images or long inference time due to their heavy computational burden on the convolution of high-resolution feature maps. In this paper, we focus on speeding-up the high-resolution photorealistic I2IT tasks based on closed-form Laplacian pyramid decomposition and reconstruction. Specifically, we reveal that the attribute transformations, such as illumination and color manipulation, relate more to the low-frequency component, while the content details can be adaptively refined on high-frequency components. We consequently propose a Laplacian Pyramid Translation Network (LPTN) to simultaneously perform these two tasks, where we design a lightweight network for translating the low-frequency component with reduced resolution and a progressive masking strategy to efficiently refine the high-frequency ones. Our model avoids most of the heavy computation consumed by processing high-resolution feature maps and faithfully preserves the image details. Extensive experimental results on various tasks demonstrate that the proposed method can translate 4K images in real-time using one normal GPU while achieving comparable transformation performance against existing methods. Datasets and codes are available: https://github.com/csjliang/LPTN. | Publisher: | Institute of Electrical and Electronics Engineers | ISBN: | 978-1-6654-4509-2 (Electronic) 978-1-6654-4510-8 (Print on Demand(PoD)) |
DOI: | 10.1109/CVPR46437.2021.00927 | Rights: | ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication J. Liang, H. Zeng and L. Zhang, "High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network," 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 2021, pp. 9387-9395 is available at https://doi.org/10.1109/CVPR46437.2021.00927. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Liang_High-Resolution_Photorealistic_Image.pdf | Pre-Published version | 3.34 MB | Adobe PDF | View/Open |
Page views
78
Citations as of May 11, 2025
Downloads
50
Citations as of May 11, 2025
SCOPUSTM
Citations
108
Citations as of Jun 19, 2025
WEB OF SCIENCETM
Citations
92
Citations as of Jun 5, 2025

Google ScholarTM
Check
Altmetric
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.