EfficientASR: A Lightweight Speech Recognition Model with Reduced Attention Redundancy and Optimized Feedforward Networks
EfficientASR employs Shared Residual Multi-Head Attention (SRMHA) to reduce redundant attention computations and Chunk-Level Feedforward Networks (CFFN) to decrease the number of parameters, resulting in a lightweight and versatile speech recognition model.