Skip to yearly menu bar Skip to main content


SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Ruben Härle ⋅ Felix Friedrich ⋅ Manuel Brack ⋅ Björn Deiseroth ⋅ Patrick Schramowski ⋅ Kristian Kersting

Abstract

Chat is not available.