Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/113676
DC Field | Value | Language |
---|---|---|
dc.contributor | Department of Data Science and Artificial Intelligence | - |
dc.contributor | Department of Computing | - |
dc.creator | Hao, X | - |
dc.creator | Ma, C | - |
dc.creator | Yang, Q | - |
dc.creator | Wu, J | - |
dc.creator | Tan, KC | - |
dc.date.accessioned | 2025-06-17T07:40:49Z | - |
dc.date.available | 2025-06-17T07:40:49Z | - |
dc.identifier.issn | 2162-237X | - |
dc.identifier.uri | http://hdl.handle.net/10397/113676 | - |
dc.language.iso | en | en_US |
dc.publisher | Institute of Electrical and Electronics Engineers | en_US |
dc.subject | Neuromorphic computing | en_US |
dc.subject | Neuromorphic speech processing | en_US |
dc.subject | Speech enhancement (SE) | en_US |
dc.subject | Spiking neural network (SNN) | en_US |
dc.title | Toward ultralow-power neuromorphic speech enhancement with Spiking-FullSubNet | en_US |
dc.type | Journal/Magazine Article | en_US |
dc.description.otherinformation | Title on author's file: Towards Ultra-Low-Power Neuromorphic Speech Enhancement with Spiking-FullSubNet | en_US |
dc.identifier.doi | 10.1109/TNNLS.2025.3566021 | - |
dcterms.abstract | Speech enhancement (SE) is critical for improving speech intelligibility and quality in various audio devices. In recent years, deep learning-based methods have significantly improved SE performance, but they often come with a high computational cost, which is prohibitive for a large number of edge devices, such as headsets and hearing aids. This work proposes an ultralow-power SE system based on the brain-inspired spiking neural network (SNN) called Spiking-FullSubNet. Spiking-FullSubNet follows a full-band and subband fusioned approach to effectively capture both global and local spectral information. To enhance the efficiency of computationally expensive subband modeling, we introduce a frequency partitioning method inspired by the sensitivity profile of the human peripheral auditory system. Furthermore, we introduce a novel spiking neuron model that can dynamically control the input information integration and forgetting, enhancing the multiscale temporal processing capability of SNN, which is critical for speech denoising. Experiments conducted on the recent Intel Neuromorphic Deep Noise Suppression (N-DNS) Challenge dataset show that the Spiking-FullSubNet surpasses state-of-the-art (SOTA) methods by large margins in terms of both speech quality and energy efficiency metrics. Notably, our system won the championship of the Intel N-DNS Challenge (algorithmic track), opening up a myriad of opportunities for ultralow-power SE at the edge. Our source code and model checkpoints are publicly available at github.com/haoxiangsnr/spiking-fullsubnet | - |
dcterms.accessRights | embargoed access | en_US |
dcterms.bibliographicCitation | IEEE transactions on neural networks and learning systems, Date of Publication: 15 May 2025, Early Access, https://doi.org/10.1109/TNNLS.2025.3566021 | - |
dcterms.isPartOf | IEEE transactions on neural networks and learning systems | - |
dcterms.issued | 2025 | - |
dc.identifier.scopus | 2-s2.0-105005554696 | - |
dc.identifier.eissn | 2162-2388 | - |
dc.description.validate | 202506 bcch | - |
dc.identifier.FolderNumber | a3717b | en_US |
dc.identifier.SubFormID | 50839 | en_US |
dc.description.fundingSource | RGC | en_US |
dc.description.fundingSource | Others | en_US |
dc.description.fundingText | National Natural Science Foundation of China | en_US |
dc.description.pubStatus | Early release | en_US |
dc.date.embargo | 0000-00-00 (to be updated) | en_US |
dc.description.oaCategory | Green (AAM) | en_US |
Appears in Collections: | Journal/Magazine Article |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.