Skip to yearly menu bar Skip to main content


How to Make LLMs Safer? Detecting and Editing Key Heads in LLMs

Kuan Chu ⋅ Chung-En Sun ⋅ Lily Weng

Abstract

Chat is not available.