Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/118678
| Title: | Geometric distortion guided transformer for omnidirectional image super-resolution | Authors: | Yang, C Dong, R Xiao, J Zhang, C Lam, KM Zhou, F Qiu, G |
Issue Date: | Jun-2025 | Source: | IEEE transactions on circuits and systems for video technology, June 2025, v. 35, no. 6, p. 5166-5181 | Abstract: | As virtual and augmented reality applications gain popularity, omnidirectional image (ODI) super-resolution has become increasingly important. Unlike 2D plain images that are formed on a plane, ODIs are projected onto spherical surfaces. Applying established image super-resolution methods to ODIs, therefore, requires performing equirectangular projection (ERP) to map the ODIs onto a plane. ODI super-resolution needs to take into account geometric distortion resulting from ERP. However, without considering such geometric distortion of ERP images, previous methods only utilize a limited range of pixels and may easily miss self-similar textures for reconstruction. In this paper, we introduce a novel Geometric Distortion Guided Transformer for Omnidirectional image Super-Resolution (GDGT-OSR). Specifically, a distortion modulated rectangle-window self-attention mechanism, integrated with deformable self-attention, is proposed to better perceive the distortion and thus involve more self-similar textures. Distortion modulation is achieved through a newly devised distortion guidance generator that produces guidance for the rectangular windows by exploiting the variability of distortion across latitudes. Furthermore, we propose a dynamic feature aggregation scheme to adaptively fuse the features from different self-attention modules. We present extensive experimental results on public datasets and show that the new GDGT-OSR outperforms methods in existing literature. | Keywords: | Distortion Omnidirectional image Rectangle-window Super-resolution Transformer |
Publisher: | Institute of Electrical and Electronics Engineers | Journal: | IEEE transactions on circuits and systems for video technology | ISSN: | 1051-8215 | EISSN: | 1558-2205 | DOI: | 10.1109/TCSVT.2025.3525900 | Rights: | © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication C. Yang et al., 'Geometric Distortion Guided Transformer for Omnidirectional Image Super-Resolution,' in IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 6, pp. 5166-5181, June 2025 is available at https://doi.org/10.1109/TCSVT.2025.3525900. |
| Appears in Collections: | Journal/Magazine Article |
Files in This Item:
| File | Description | Size | Format | |
|---|---|---|---|---|
| Yang_Geometric_Distortion_Guided.pdf | Pre-Published version | 5.06 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.



