HAAT : hybrid attention aggregation transformer for image super-resolution

Lai, SJ; Cheung, TH; Fung, KC; Xue, KW; Lam, KM

doi:10.1117/12.3058003

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/114623

Title:	HAAT : hybrid attention aggregation transformer for image super-resolution
Authors:	Lai, SJ Cheung, TH Fung, KC Xue, KW Lam, KM
Issue Date:	2025
Source:	Proceedings of SPIE : the International Society for Optical Engineering, 2025, v. 13510, 135101A
Abstract:	In the research area of image super-resolution, Swin-transformer-based models are favored for their global spatial modeling and shifting window attention mechanism. However, existing methods often limit self-attention to nonoverlapping windows to cut costs and ignore the useful information that exists across channels. To address this issue, this paper introduces a novel model, the Hybrid Attention Aggregation Transformer (HAAT), designed to better leverage feature information. HAAT is constructed by integrating Swin-Dense-Residual-Connected Blocks (SDRCB) with Hybrid Grid Attention Blocks (HGAB). SDRCB expands the receptive field while maintaining a streamlined architecture, resulting in enhanced performance. HGAB incorporates channel attention, sparse attention, and window attention to improve nonlocal feature fusion and achieve more visually compelling results. Experimental evaluations demonstrate that HAAT surpasses state-of-the-art methods on benchmark datasets.
Keywords:	Attention mechanism Computer vision Image super-resolution Transformer
Publisher:	SPIE - International Society for Optical Engineering
Journal:	Proceedings of SPIE : the International Society for Optical Engineering
ISSN:	0277-786X
EISSN:	1996-756X
DOI:	10.1117/12.3058003
Description:	International Workshop on Advanced Imaging Technology (IWAIT) 2025, 6-8 January 2025, Douliu City, Taiwan
Rights:	Copyright 2024 Society of Photo‑Optical Instrumentation Engineers (SPIE). One print or electronic copy may be made for personal use only. Systematic reproduction and distribution, duplication of any material in this publication for a fee or for commercial purposes, and modification of the contents of the publication are prohibited. The following publication Song-Jiang Lai, Tsun-Hin Cheung, Ka-Chun Fung, Kai-wen Xue, and Kin-Man Lam "HAAT: hybrid attention aggregation transformer for image super-resolution", Proc. SPIE 13510, International Workshop on Advanced Imaging Technology (IWAIT) 2025, 135101A (5 February 2025) is available at https://doi.org/10.1117/12.3058003.
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
135101A.pdf		911.56 kB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Version of Record

Access

View full-text via PolyU eLinks

Show full item record

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM