Skip to yearly menu bar Skip to main content


Poster

Aligning Large Language Models with Representation Editing: A Control Perspective

Lingkai Kong ⋅ Haorui Wang ⋅ Wenhao Mu ⋅ Yuanqi Du ⋅ Yuchen Zhuang ⋅ Yifei Zhou ⋅ Yue Song ⋅ Rongzhi Zhang ⋅ Kai Wang ⋅ Chao Zhang
2024 Poster

Abstract

Video

Chat is not available.