Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105456
PIRA download icon_1.1View/Download Full Text
Title: High-resolution photorealistic image translation in real-time : a laplacian pyramid translation network
Authors: Liang, J 
Zeng, H 
Zhang, L 
Issue Date: 2021
Source: 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021, p. 9387-9395
Abstract: Existing image-to-image translation (I2IT) methods are either constrained to low-resolution images or long inference time due to their heavy computational burden on the convolution of high-resolution feature maps. In this paper, we focus on speeding-up the high-resolution photorealistic I2IT tasks based on closed-form Laplacian pyramid decomposition and reconstruction. Specifically, we reveal that the attribute transformations, such as illumination and color manipulation, relate more to the low-frequency component, while the content details can be adaptively refined on high-frequency components. We consequently propose a Laplacian Pyramid Translation Network (LPTN) to simultaneously perform these two tasks, where we design a lightweight network for translating the low-frequency component with reduced resolution and a progressive masking strategy to efficiently refine the high-frequency ones. Our model avoids most of the heavy computation consumed by processing high-resolution feature maps and faithfully preserves the image details. Extensive experimental results on various tasks demonstrate that the proposed method can translate 4K images in real-time using one normal GPU while achieving comparable transformation performance against existing methods. Datasets and codes are available: https://github.com/csjliang/LPTN.
Publisher: Institute of Electrical and Electronics Engineers
ISBN: 978-1-6654-4509-2 (Electronic)
978-1-6654-4510-8 (Print on Demand(PoD))
DOI: 10.1109/CVPR46437.2021.00927
Rights: ©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication J. Liang, H. Zeng and L. Zhang, "High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network," 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 2021, pp. 9387-9395 is available at https://doi.org/10.1109/CVPR46437.2021.00927.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Liang_High-Resolution_Photorealistic_Image.pdfPre-Published version3.34 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Page views

78
Citations as of May 11, 2025

Downloads

50
Citations as of May 11, 2025

SCOPUSTM   
Citations

108
Citations as of Jun 19, 2025

WEB OF SCIENCETM
Citations

92
Citations as of Jun 5, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.