Wan, R., Zhang, Q., & Karniadakis, G. E. (2024). Randomized Forward Mode Gradient for Spiking Neural Networks in Scientific Machine Learning. arXiv preprint arXiv:2411.07057.
This paper aims to address the limitations of back-propagation in training SNNs, particularly its biological implausibility and incompatibility with neuromorphic hardware. The authors propose and evaluate a novel training method based on randomized forward-mode gradient and weight perturbation as a more biologically plausible and potentially more efficient alternative.
The authors utilize a weight perturbation method within a forward-mode gradient framework to train SNNs. Instead of back-propagating errors, they perturb the weight matrix with small noise and estimate gradients by observing the changes in the network output. They explore two methods for determining surrogate gradients: the standard surrogate gradient and a weak form using Stein's lemma. The approach is evaluated on regression tasks, including function approximation and solving differential equations, using spiking versions of DeepONet and SepONet architectures.
The proposed randomized forward-mode gradient approach achieves competitive accuracy compared to back-propagation on the tested regression tasks. The authors demonstrate the effectiveness of their method in approximating a 2D Mexican hat wavelet, solving a 1D Poisson equation, and simulating a nonlinear reaction-diffusion PDE.
The study demonstrates the viability of randomized forward-mode gradient with weight perturbation as a biologically plausible training method for SNNs. This approach offers potential advantages in terms of biological realism and hardware compatibility, particularly for neuromorphic systems.
This research contributes to the development of more biologically plausible and hardware-efficient training algorithms for SNNs, potentially paving the way for wider adoption of SNNs in scientific machine learning and other domains.
The study primarily focuses on relatively small SNNs and specific regression tasks. Further research is needed to evaluate the scalability and generalizability of the proposed method to larger networks and more complex tasks. Additionally, exploring multi-directional perturbations and implementing the method on neuromorphic hardware like Intel's Loihi-2 are promising directions for future work.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Ruyin Wan, Q... at arxiv.org 11-12-2024
https://arxiv.org/pdf/2411.07057.pdfDeeper Inquiries