Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/105456
PIRA download icon_1.1View/Download Full Text
DC FieldValueLanguage
dc.contributorDepartment of Computing-
dc.creatorLiang, Jen_US
dc.creatorZeng, Hen_US
dc.creatorZhang, Len_US
dc.date.accessioned2024-04-15T07:34:29Z-
dc.date.available2024-04-15T07:34:29Z-
dc.identifier.isbn978-1-6654-4509-2 (Electronic)en_US
dc.identifier.isbn978-1-6654-4510-8 (Print on Demand(PoD))en_US
dc.identifier.urihttp://hdl.handle.net/10397/105456-
dc.language.isoenen_US
dc.publisherInstitute of Electrical and Electronics Engineersen_US
dc.rights©2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.en_US
dc.rightsThe following publication J. Liang, H. Zeng and L. Zhang, "High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network," 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, TN, USA, 2021, pp. 9387-9395 is available at https://doi.org/10.1109/CVPR46437.2021.00927.en_US
dc.titleHigh-resolution photorealistic image translation in real-time : a laplacian pyramid translation networken_US
dc.typeConference Paperen_US
dc.identifier.spage9387en_US
dc.identifier.epage9395en_US
dc.identifier.doi10.1109/CVPR46437.2021.00927en_US
dcterms.abstractExisting image-to-image translation (I2IT) methods are either constrained to low-resolution images or long inference time due to their heavy computational burden on the convolution of high-resolution feature maps. In this paper, we focus on speeding-up the high-resolution photorealistic I2IT tasks based on closed-form Laplacian pyramid decomposition and reconstruction. Specifically, we reveal that the attribute transformations, such as illumination and color manipulation, relate more to the low-frequency component, while the content details can be adaptively refined on high-frequency components. We consequently propose a Laplacian Pyramid Translation Network (LPTN) to simultaneously perform these two tasks, where we design a lightweight network for translating the low-frequency component with reduced resolution and a progressive masking strategy to efficiently refine the high-frequency ones. Our model avoids most of the heavy computation consumed by processing high-resolution feature maps and faithfully preserves the image details. Extensive experimental results on various tasks demonstrate that the proposed method can translate 4K images in real-time using one normal GPU while achieving comparable transformation performance against existing methods. Datasets and codes are available: https://github.com/csjliang/LPTN.-
dcterms.accessRightsopen accessen_US
dcterms.bibliographicCitation2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Virtual, 19-25 June 2021, p. 9387-9395en_US
dcterms.issued2021-
dc.identifier.scopus2-s2.0-85122718465-
dc.relation.conferenceIEEE/CVF Conference on Computer Vision and Pattern Recognition [CVPR]-
dc.description.validate202402 bcch-
dc.description.oaAccepted Manuscripten_US
dc.identifier.FolderNumberCOMP-0038-
dc.description.fundingSourceRGCen_US
dc.description.pubStatusPublisheden_US
dc.identifier.OPUS56309612-
dc.description.oaCategoryGreen (AAM)en_US
Appears in Collections:Conference Paper
Files in This Item:
File Description SizeFormat 
Liang_High-Resolution_Photorealistic_Image.pdfPre-Published version3.34 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show simple item record

Page views

78
Citations as of May 11, 2025

Downloads

50
Citations as of May 11, 2025

SCOPUSTM   
Citations

108
Citations as of Jun 19, 2025

WEB OF SCIENCETM
Citations

92
Citations as of Jun 5, 2025

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.