Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Foundation Model Interventions

Steering semantic search with interpretable features from sparse autoencoders

Christine Ye ⋅ Charles O'Neill ⋅ John Wu ⋅ Kartheik Iyer

Abstract

Chat is not available.