Timezone: »

Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
Thomas Hartvigsen · Swami Sankaranarayanan · Hamid Palangi · Yoon Kim · Marzyeh Ghassemi

Fri Dec 02 12:25 PM -- 12:35 PM (PST) @
Event URL: https://openreview.net/forum?id=xupL1Q0ft- »

Large language models often err during deployment due to non-representative training data or distribution shift in the test set. Recently, model editors have been proposed to fix errors by adjusting a pre-trained model's weights. However, these approaches quickly decay a model's performance on upstream data, and forget how to fix previous errors. We propose and study a novel Lifelong Model Editing setting, where errors stream into a deployed model and we update the model to correct its predictions without influencing it for unrelated inputs. We propose General Retrieval Adaptors for Continual Editing, or GRACE, which learns and caches a particular layer's activations in a codebook as edits stream in, while the original model weights remain frozen. This ensures similar edits are treated similarly without altering the model's performance on unrelated instances. Experimentally, we show that GRACE substantially improves over recent model editors.

Author Information

Thomas Hartvigsen (Massachusetts Institute of Technology)
Swami Sankaranarayanan (Massachusetts Institute of Technology)
Hamid Palangi (Microsoft Research)
Yoon Kim (Massachusetts Institute of Technology)
Marzyeh Ghassemi (MIT)

Related Events (a corresponding poster, oral, or spotlight)

More from the Same Authors