Skip to yearly menu bar Skip to main content


Poster
in
Workshop: Foundation Model Interventions

Do LLMs internally ``know'' when they follow instructions?

Juyeon Heo ⋅ Christina Heinze-Deml ⋅ Shirley Ren ⋅ Oussama Elachqar ⋅ Udhyakumar Nallasamy ⋅ Andy Miller ⋅ Jaya Narain

Abstract

Chat is not available.