Toward ultralow-power neuromorphic speech enhancement with Spiking-FullSubNet

Hao, X; Ma, C; Yang, Q; Wu, J; Tan, KC

doi:10.1109/TNNLS.2025.3566021

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113676

DC Field	Value	Language
dc.contributor	Department of Data Science and Artificial Intelligence	-
dc.contributor	Department of Computing	-
dc.creator	Hao, X	-
dc.creator	Ma, C	-
dc.creator	Yang, Q	-
dc.creator	Wu, J	-
dc.creator	Tan, KC	-
dc.date.accessioned	2025-06-17T07:40:49Z	-
dc.date.available	2025-06-17T07:40:49Z	-
dc.identifier.issn	2162-237X	-
dc.identifier.uri	http://hdl.handle.net/10397/113676	-
dc.language.iso	en	en_US
dc.publisher	Institute of Electrical and Electronics Engineers	en_US
dc.subject	Neuromorphic computing	en_US
dc.subject	Neuromorphic speech processing	en_US
dc.subject	Speech enhancement (SE)	en_US
dc.subject	Spiking neural network (SNN)	en_US
dc.title	Toward ultralow-power neuromorphic speech enhancement with Spiking-FullSubNet	en_US
dc.type	Journal/Magazine Article	en_US
dc.description.otherinformation	Title on author's file: Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet	en_US
dc.identifier.doi	10.1109/TNNLS.2025.3566021	-
dcterms.abstract	Speech enhancement (SE) is critical for improving speech intelligibility and quality in various audio devices. In recent years, deep learning-based methods have significantly improved SE performance, but they often come with a high computational cost, which is prohibitive for a large number of edge devices, such as headsets and hearing aids. This work proposes an ultralow-power SE system based on the brain-inspired spiking neural network (SNN) called Spiking-FullSubNet. Spiking-FullSubNet follows a full-band and subband fusioned approach to effectively capture both global and local spectral information. To enhance the efficiency of computationally expensive subband modeling, we introduce a frequency partitioning method inspired by the sensitivity profile of the human peripheral auditory system. Furthermore, we introduce a novel spiking neuron model that can dynamically control the input information integration and forgetting, enhancing the multiscale temporal processing capability of SNN, which is critical for speech denoising. Experiments conducted on the recent Intel Neuromorphic Deep Noise Suppression (N-DNS) Challenge dataset show that the Spiking-FullSubNet surpasses state-of-the-art (SOTA) methods by large margins in terms of both speech quality and energy efficiency metrics. Notably, our system won the championship of the Intel N-DNS Challenge (algorithmic track), opening up a myriad of opportunities for ultralow-power SE at the edge. Our source code and model checkpoints are publicly available at github.com/haoxiangsnr/spiking-fullsubnet	-
dcterms.accessRights	embargoed access	en_US
dcterms.bibliographicCitation	IEEE transactions on neural networks and learning systems, Date of Publication: 15 May 2025, Early Access, https://doi.org/10.1109/TNNLS.2025.3566021	-
dcterms.isPartOf	IEEE transactions on neural networks and learning systems	-
dcterms.issued	2025	-
dc.identifier.scopus	2-s2.0-105005554696	-
dc.identifier.eissn	2162-2388	-
dc.description.validate	202506 bcch	-
dc.identifier.FolderNumber	a3717b	en_US
dc.identifier.SubFormID	50839	en_US
dc.description.fundingSource	RGC	en_US
dc.description.fundingSource	Others	en_US
dc.description.fundingText	National Natural Science Foundation of China	en_US
dc.description.pubStatus	Early release	en_US
dc.date.embargo	0000-00-00 (to be updated)	en_US
dc.description.oaCategory	Green (AAM)	en_US
Appears in Collections:	Journal/Magazine Article

Open Access Information

Status	embargoed access
Embargo End Date	0000-00-00 (to be updated)

Access

View full-text via PolyU eLinks

Show simple item record

Google Scholar^TM

Check

Open Access Information

Access

Google ScholarTM

Altmetric

Google Scholar^TM