Spiking-leaf : a learnable auditory front-end for spiking neural networks

Song, Z; Wu, J; Zhang, M; Shou, MZ; Li, H

doi:10.1109/ICASSP48485.2024.10446789

Please use this identifier to cite or link to this item: http://hdl.handle.net/10397/107478

Title:	Spiking-leaf : a learnable auditory front-end for spiking neural networks
Authors:	Song, Z Wu, J Zhang, M Shou, MZ Li, H
Issue Date:	2024
Source:	2024 IEEE International Conference on Acoustics, Speech,and Signal Processing : Proceedings : 14-19 April 2024, COEX, Seoul, Korea, p. 226-230
Abstract:	Brain-inspired spiking neural networks (SNNs) have demonstrated great potential for temporal signal processing. However, their performance in speech processing remains limited due to the lack of an effective auditory front-end. To address this limitation, we introduce Spiking-LEAF, a learnable auditory front-end meticulously designed for SNN-based speech processing. Spiking-LEAF combines a learnable filter bank with a novel two-compartment spiking neuron model called IHC-LIF. The IHC-LIF neurons draw inspiration from the structure of inner hair cells (IHC) and they leverage segregated dendritic and somatic compartments to effectively capture multi-scale temporal dynamics of speech signals. Additionally, the IHC-LIF neurons incorporate the lateral feedback mechanism along with spike regularization loss to enhance spike encoding efficiency. On keyword spotting and speaker identification tasks, the proposed Spiking-LEAF outperforms both SOTA spiking auditory front-ends and conventional real-valued acoustic features in terms of classification accuracy, noise robustness, and encoding efficiency.
Keywords:	Learnable audio front-end Speech recognition Spike encoding Spiking neural networks
Publisher:	Institute of Electrical and Electronics Engineers
ISBN:	979-8-3503-4485-1
DOI:	10.1109/ICASSP48485.2024.10446789
Rights:	© 2024 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The following publication Z. Song, J. Wu, M. Zhang, M. Z. Shou and H. Li, "Spiking-Leaf: A Learnable Auditory Front-End for Spiking Neural Networks," ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, Korea, Republic of, 2024, pp. 226-230 is available at https://doi.org/10.1109/ICASSP48485.2024.10446789.
Appears in Collections:	Conference Paper

Files in This Item:

File	Description	Size	Format
Song_Spiking-leaf_Learnable_Auditory.pdf	Preprint version	1.19 MB	Adobe PDF	View/Open

Open Access Information

Status	open access
File Version	Author’s Original

Access

View full-text via PolyU eLinks

Show full item record

Page views

4

Citations as of Jun 30, 2024

Google Scholar^TM

Check

Files in This Item:

Open Access Information

Access

Page views

Google ScholarTM

Altmetric

Google Scholar^TM