NeurIPS Retrieving $k$-Nearest Memories with Modern Hopfield Networks

Oral
in
Workshop: Associative Memory & Hopfield Networks in 2023

Retrieving $k$-Nearest Memories with Modern Hopfield Networks

Alexander Davydov · Sean Jaffe · Ambuj K Singh · Francesco Bullo

[ Abstract ] [ Project Page ]

[ OpenReview]

Abstract:

Modern continuous Hopfield networks (MCHNs) are a variant of Hopfield networks that have greater storage capacity and have been shown to have connections to the attention mechanism in transformers. In this paper, we propose a variant of MCHNs, which we call k-Hopfield layers, which is the first Hopfield-type network that retrieves the k-nearest memories to a given input. k-Hopfield layers are differentiable and may serve as (i) a soft approach to k-nearest neighbors, (ii) an augmented form of memory in deep learning architectures and (iii) an alternative to multihead attention in transformers. We empirically demonstrate that increasing k aids in correctly reconstructing a corrupted input. We show that using a k-Hopfield layer as a replacement to multihead attention demonstrates comparable performance in small vision transformers while requiring fewer parameters.

Chat is not available.

Oral in Workshop: Associative Memory & Hopfield Networks in 2023

Retrieving $k$-Nearest Memories with Modern Hopfield Networks

Alexander Davydov · Sean Jaffe · Ambuj K Singh · Francesco Bullo

Oral
in
Workshop: Associative Memory & Hopfield Networks in 2023