ProTEA: A Programmable FPGA Accelerator for Efficient Transformer Encoder Inference
ProTEA is a runtime programmable FPGA accelerator designed to efficiently execute the computationally intensive multi-head attention and feedforward neural network layers of transformer encoder models.