Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Foundation Model Interventions

Steering semantic search with interpretable features from sparse autoencoders

Christine Ye · Charles O'Neill · John Wu · Kartheik Iyer

Abstract

Chat is not available.