Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Foundation Model Interventions

Analysing the Residual Stream of Language Models Under Knowledge Conflicts

Yu Zhao ⋅ Xiaotang Du ⋅ Giwon Hong ⋅ Aryo Gema ⋅ Alessio Devoto ⋅ Hongru WANG ⋅ Xuanli He ⋅ Kam-Fai Wong ⋅ Pasquale Minervini
Keywords: Knowledge Conflict

Abstract

Chat is not available.