Please use this identifier to cite or link to this item:
http://hdl.handle.net/10397/113673
Title: | Global-local convolution with spiking neural networks for energy-efficient keyword spotting | Authors: | Wang, S Zhang, D Shi, K Wang, Y Wei, W Wu, J Zhang, M |
Issue Date: | 2024 | Source: | 25th Annual Conference of the International Speech Communication Association (INTERSPEECH 2024), Kos, Greece, 1-5 September 2024, p. 4523-4527 | Abstract: | Thanks to Deep Neural Networks (DNNs), the accuracy of Keyword Spotting (KWS) has made substantial progress. However, as KWS systems are usually implemented on edge devices, energy efficiency becomes a critical requirement besides performance. Here, we take advantage of spiking neural networks' energy efficiency and propose an end-to-end lightweight KWS model. The model consists of two innovative modules: 1) Global-Local Spiking Convolution (GLSC) module and 2) Bottleneck-PLIF module. Compared to the hand-crafted feature extraction methods, the GLSC module achieves speech feature extraction that is sparser, more energy-efficient, and yields better performance. The Bottleneck-PLIF module further processes the signals from GLSC with the aim to achieve higher accuracy with fewer parameters. Extensive experiments are conducted on the Google Speech Commands Dataset (V1 and V2). The results show our method achieves competitive performance among SNN-based KWS models with fewer parameters. | Keywords: | Global-Local spiking convolution Keyword spotting Spiking neural networks |
Publisher: | International Speech Communication Association | ISBN: | 979-8-3313-0506-2 | DOI: | 10.21437/Interspeech.2024-642 | Description: | 25th Annual Conference of the International Speech Communication Association, Interspeech 2024, Kos, Greece, September 1-5, 2024. ISCA 2024 | Rights: | Copyright © (2024) by International Speech Communication Association All rights reserved. The following publication Wang, S., Zhang, D., Shi, K., Wang, Y., Wei, W., Wu, J., Zhang, M. (2024) Global-Local Convolution with Spiking Neural Networks for Energy-efficient Keyword Spotting. Proc. Interspeech 2024, 4523-4527 is available at https://doi.org/10.21437/Interspeech.2024-642. |
Appears in Collections: | Conference Paper |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
wang24p_interspeech.pdf | 3.06 MB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.