Skip to yearly menu bar Skip to main content


SCAR: Sparse Conditioned Autoencoders for Concept Detection and Steering in LLMs

Ruben Härle · Felix Friedrich · Manuel Brack · Björn Deiseroth · Patrick Schramowski · Kristian Kersting

Abstract

Chat is not available.