The paper investigates the computational limits of modern Hopfield models, a type of associative memory model compatible with deep learning. The key contributions are:
Computational Limits: The authors identify a phase transition behavior on the norm of query and memory patterns, assuming the Strong Exponential Time Hypothesis (SETH). They prove an upper bound criterion B* = Θ(√log τ) for the norms, such that only below this criterion can sub-quadratic (efficient) variants of the modern Hopfield model exist.
Efficient Model: The authors provide an efficient algorithm for the approximate modern Hopfield memory retrieval problem (AHop) based on low-rank approximation. This algorithm achieves nearly linear time complexity τ^(1+o(1)) under realistic settings, where τ = max{M, L} is the upper bound of the pattern lengths.
Exponential Memory Capacity: For the nearly-linear-time modern Hopfield model, the authors derive its retrieval error bound and show that it maintains the exponential memory capacity characteristic of modern Hopfield models, while achieving the improved efficiency.
The paper establishes the computational limits of modern Hopfield models and provides a concrete example of an efficient variant, which is crucial for advancing Hopfield-based large foundation models.
Naar een andere taal
vanuit de broninhoud
arxiv.org
Belangrijkste Inzichten Gedestilleerd Uit
by Jerry Yao-Ch... om arxiv.org 04-08-2024
https://arxiv.org/pdf/2402.04520.pdfDiepere vragen