Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/113672
PIRA download icon_1.1View/Download Full Text
Title: When audio denoising meets spiking neural network
Authors: Hao, X 
Ma, C 
Yang, Q
Tan, KC 
Wu, J 
Issue Date: 2024
Source: Proceedings : 2024 IEEE Conference on Artificial Intelligence CAI 2024 : 25-27 June 2024, Marina Bay Sands, Singapore, p. 1524-1527
Abstract: Audio denoising techniques are essential tools for enhancing audio quality. Spiking neural networks (SNNs) offer promising opportunities for audio denoising, as they leverage brain-inspired architectures and computational principles to efficiently process and analyze audio signals, enabling real-time denoising with improved accuracy and reduced computational overhead. This paper introduces Spiking-FullSubNet, a real-time audio denoising model based on SNN. Our proposed model incorporates a novel gated spiking neuron model (GSN) to effectively capture multi-scale temporal information, which is crucial for achieving high-fidelity audio denoising. Furthermore, we propose the integration of GSNs within an optimized FullSubNet neural architecture, enabling efficient processing of full-band and sub-band frequencies while significantly reducing computational overhead. Alongside the architectural advancements, we incorporate a metric discriminator-based loss function that selectively enhances the desired performance metrics without compromising others. Empirical evaluations show the superior performance of Spiking-FullSubNet, ranking it as the winner of Track 1 (Algorithmic) of the Intel Neuromorphic Deep Noise Suppression Challenge.
Keywords: Audio signal processing
Neuromorphic computing
Speech denoising
Spiking neural network
Publisher: Institute of Electrical and Electronics Engineers
ISBN: 979-8-3503-5409-6
DOI: 10.1109/CAI59869.2024.00275
Description: 2024 IEEE Conference on Artificial Intelligence CAI 2024 : 25-27 June 2024, Marina Bay Sands, Singapore
Rights: © 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
The following publication X. Hao, C. Ma, Q. Yang, K. C. Tan and J. Wu, "When Audio Denoising Meets Spiking Neural Network," 2024 IEEE Conference on Artificial Intelligence (CAI), Singapore, Singapore, 2024, pp. 1524-1527 is available at https://doi.org/10.1109/CAI59869.2024.00275.
Appears in Collections:Conference Paper

Files in This Item:
File Description SizeFormat 
Hao_When_Audio_Denoising.pdfPre-Published version1.5 MBAdobe PDFView/Open
Open Access Information
Status open access
File Version Final Accepted Manuscript
Access
View full-text via PolyU eLinks SFX Query
Show full item record

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.