Core Concepts
Spiking-LEAF enhances speech processing with a learnable auditory front-end for spiking neural networks.
Abstract
The content introduces the Spiking-LEAF model, designed to improve speech processing in spiking neural networks. It combines a learnable filter bank with a novel two-compartment spiking neuron model called IHC-LIF. The model outperforms existing auditory front-ends in keyword spotting and speaker identification tasks, showcasing higher accuracy, noise robustness, and encoding efficiency. The paper details the architecture, methods used, experimental results, ablation studies, and conclusions.
Abstract:
Introduces Spiking-LEAF for SNN-based speech processing.
Combines learnable filter bank with IHC-LIF neuron model.
Introduction:
SNNs excel in sequential modeling but lag in speech tasks.
Methods:
Features Gabor 1d-convo filter bank and PCEN for extraction.
Results:
Spiking-LEAF surpasses existing front-ends on KWS and SI tasks.
Ablation Studies:
Learnable features enhance representation power.
Conclusion:
Spiking-LEAF offers improved feature extraction and encoding efficiency.
Quotes
"Spiking LEarnable Audio front-end model, called Spiking-LEAF." - Content
"Our proposed Spiking LEarnable Audio front-end shows high classification accuracy." - Content