Core Concepts
Variational quantum transformer architecture leveraging self-attention through a kernel-based approach.
Abstract
Abstract:
Introduction to the SASQuaTCh model, a novel variational quantum transformer.
Application of self-attention mechanism in quantum circuits using Fourier transform.
Introduction and Related Work:
Overview of quantum computing applications and focus on noisy intermediate scale quantum (NISQ) devices.
Emphasis on quantum machine learning combining properties of quantum theory with machine learning algorithms.
Classical Self-Attention and Multi-Head Attention Networks:
Description of the transformer network architecture with self-attention mechanism.
Explanation of multi-head attention for parallel projections into different subspaces.
Kernel Convolution and Visual Attention Networks:
Integration of self-attention mechanism as a kernel integral transform in neural networks.
SASQuaTCh: A Quantum Fourier Vision Transformer Circuit:
Implementation details of the SASQuaTCh model for image classification tasks.
Sequential Quantum Vision Transformers:
Discussion on deep layering within a single quantum circuit for enhanced flexibility and approximation capabilities.
Discussion & Conclusion:
Exploration of future research directions including geometric priors in dataset modeling.
Stats
Speedup achieved by QFT over classical FFT operations is exponential. (Source: arXiv:2403.14753v1)
Quotes
"Attention is all you need." - Vaswani et al., 2017
"An image is worth 16x16 words." - Dosovitskiy et al., 2021
"Robust speech recognition via large-scale weak supervision." - Radford et al., 2023
"Transformers for modeling physical systems." - Geneva and Zabaras, 2022
"A fast quantum mechanical algorithm for database search." - Grover, 1996
"Quantum embeddings for machine learning." - Lloyd et al., 2020
"Fourier neural operator for parametric partial differential equations." - Li et al., 2020
"Adaptive fourier neural operators: Efficient token mixers for transformers." - Guibas et al., 2021
"Fourcastnet: A global data-driven high-resolution weather model using adaptive fourier neural operators." - Pathak et al., 2022
"Circuit-centric quantum classifiers." - Schuld et al., 2020