Skip to yearly menu bar Skip to main content


Poster

Improving Sparse Decomposition of Language Model Activations with Gated Sparse Autoencoders

Senthooran Rajamanoharan ⋅ Arthur Conmy ⋅ Lewis Smith ⋅ Tom Lieberum ⋅ Vikrant Varma ⋅ Janos Kramar ⋅ Rohin Shah ⋅ Neel Nanda
2024 Poster
[ Paper [ Poster [ OpenReview

Abstract

Video

Chat is not available.